From elli at sustainlane.com Tue Jun 2 00:01:16 2009 From: elli at sustainlane.com (Elli Albek) Date: Tue Jun 2 00:01:28 2009 Subject: [uf-discuss] Microformats and keyword spamming Message-ID: <9FADEAEF483F4DCA877446C3A9ADA2A4@ealbek> Problems again. Building a simple yelp style business review page requires us to repeat a lot of information on the page. This has a few results: 1. Non semantic HTML. The pages include a lot of repeating terms in the wrong places. There is no way to avoid it if we want to use hreview. 2. The pages become less accessible, since the HTML starts deviating from its semantic form. For example, include pattern requires: a. object tags (this should ALWAYS be avoided at all cost!!!) b. empty A tags C. A tags with redundant information that is constantly repeating on the page. This is what we currently do as the lesser of all evils. 3. Needing to repeat so much text on the page affects search engine results: Repeating terms that are almost irrelevant to the page, like business on each and every hreview. 4. Repeating important keywords on the page TOO MANY TIMES, such as the business name on every review: Maharishi Ayurveda Health Spa This added the page main keywords so many times that I suspect it borders keyword spamming in the eyes of the search engine. WHAT TO DO: Those are the options now: 1. The risk of a good content site becoming a keyword spammer, outweighs any benefit of using microformats that I can currently see. Making the pages non semantic in order to support microformats, possibly already hurt our rankings. Solution: remove microformats from reviews pages entirely. 2. Use Google's cut down version of microformats. This may not follow the spec, but if we follow google most of our problems are solved. What I like about many google APIs is their practical approach. In that case I think their view of microformats is more practical then the spec. It certainly solves a lot of our problems. 3. Alternative formats: RDF? 4. Direct feed to search engines in proprietary formats. We will still support hcard for the business directory, but will remove support for hreview since this is the major source of problems. This all started when we wanted to added review aggregate, and the pages are starting to deteriorate. Advice is welcome and appreciated. I would really appreciate: ** An example that shows how to build a REAL reviews page. ** 1. That page includes the business information once and only once on the page, where the business name is in H1 (in hcard). 2. That page includes review aggregate, which does not require any repetition or hidden text. 3. A few reviews of the said business, WHICH DO NOT REQUIRE ANY REPETITION of any item information, the type of business, etc. 4. Business name shows once and only once on the page in the hcard and nowhere else. 5. Listing type (business/product) shows once and only once on the page. Natural place in the hcard, but according to the spec it is not possible. E From lists at ben-ward.co.uk Tue Jun 2 01:42:14 2009 From: lists at ben-ward.co.uk (Ben Ward) Date: Tue Jun 2 01:42:22 2009 Subject: [uf-discuss] Microformats and keyword spamming In-Reply-To: <9FADEAEF483F4DCA877446C3A9ADA2A4@ealbek> References: <9FADEAEF483F4DCA877446C3A9ADA2A4@ealbek> Message-ID: <285AB944-3564-4E3D-B994-AC0563F13B42@ben-ward.co.uk> Hi Elli, On 2 Jun 2009, at 00:01, Elli Albek wrote: > 1. Non semantic HTML. The pages include a lot of repeating terms in > the > wrong places. There is no way to avoid it if we want to use hreview. Repeated content is of course undesirable, but please don't conflate it with being ?non semantic?. > 2. The pages become less accessible, since the HTML starts deviating > from > its semantic form. For example, include pattern requires: The include pattern offers you choices to reference other content in the page. It's not perfect, I agree. I would appreciate some clarity on your problems below, though: > a. object tags (this should ALWAYS be avoided at all cost!!!) We are aware of and have documented scenarios where `object` is problematic, but that doesn't equate to ?ALWAYS be avoided at all cost?. If you have a new case with the `object` version is as horrific as you make out, please document it on http://microformats.org/wiki/include-pattern-issues . If not, please keep your descriptions concise and free from overreaction, as it confuses attempts to assist you. > b. empty A tags These are pretty much outlawed by the spec now, based on previous accessibility research, > C. A tags with redundant information that is constantly repeating on > the > page. This is what we currently do as the lesser of all evils. This is a compromise, of course. Is it really evil though? Suboptimal, sure, but evil? You'd be repeating the name of the restaurant somewhere. It can be fit into the structure cleanly, and it can be hidden if you want.
? etc.
Yes, not perfect. But can be designed in cleanly. > 3. Needing to repeat so much text on the page affects search engine > results: > > Repeating terms that are almost irrelevant to the page, like > business > on each and every hreview. The review 'type' is optional. If it doesn't fit your publishing pattern, just leave it out. > 4. Repeating important keywords on the page TOO MANY TIMES, such as > the > business name on every review: > > Maharishi Ayurveda Health > Spa > > > This added the page main keywords so many times that I suspect it > borders > keyword spamming in the eyes of the search engine. Can you provide more info than that you ?suspect?? Obviously that's a serious issue if true, but these hyperlinks are linking to fragments within the same page, not to other resources. To the best of my knowledge, that has nothing to do with establishing keywords for a page. Is there documentation to the contrary? > 2. Use Google's cut down version of microformats. This may not > follow the > spec, but if we follow google most of our problems are solved. What > I like > about many google APIs is their practical approach. In that case I > think > their view of microformats is more practical then the spec. It > certainly > solves a lot of our problems. Again, can you provide a link to this? Google's Rich Snippets documentation provides some small examples of hReview, but links to the microformats spec as the definitive reference. I'm unaware of any ?cut down? version of microformat specs from Google, nor can I see anything in their examples suggesting this. > 4. Direct feed to search engines in proprietary formats. We will still > support hcard for the business directory, but will remove support for > hreview since this is the major source of problems. Since search engines don't currently accept any alternative ?direct feed? format I don't quite know what option 4 is supposed to entail, and again, precise clarity of major problems makes the issue easier to work with. > Advice is welcome and appreciated. > > I would really appreciate: > > ** An example that shows how to build a REAL reviews page. ** > > 1. That page includes the business information once and only once on > the > page, where the business name is in H1 (in hcard). > 2. That page includes review aggregate, which does not require any > repetition or hidden text. > 3. A few reviews of the said business, WHICH DO NOT REQUIRE ANY > REPETITION > of any item information, the type of business, etc. > 4. Business name shows once and only once on the page in the hcard and > nowhere else. > 5. Listing type (business/product) shows once and only once on the > page. > Natural place in the hcard, but according to the spec it is not > possible. I'm now a little confused. Up to now you're talking about hreview, and now you refer to hreview-aggregate. As far as I'm aware, the `hreview-aggregate` can be created without any repetition or hidden text. The item will be a child of the aggregate. The subsequent reviews would need to use the include pattern to reference the original item to avoid major repetition. Currently, there is no other way. Now, I've asked questions here because you've been quite imprecise with some of your complaints, and I need to be clear about where you've found an explicit, documentable problem and where you're just expressing frustration. Please don't get me wrong, I completely agree that the include pattern is imperfect. And I do think better structural parsing should be pursued. But, with regard to helping your problem right now, that means being clear about problems you have with the current spec. If you can follow up more precisely to the above and provide more info where necessary, hopefully we can nail down a solution for you in the shorter term. Please provide example mark-up if you can (add it to the wiki if you think it illustrates an issue, too). Thanks, Ben From mail at tobyinkster.co.uk Tue Jun 2 03:36:56 2009 From: mail at tobyinkster.co.uk (Toby Inkster) Date: Tue Jun 2 03:37:01 2009 Subject: [uf-discuss] Microformats and keyword spamming In-Reply-To: <9FADEAEF483F4DCA877446C3A9ADA2A4@ealbek> References: <9FADEAEF483F4DCA877446C3A9ADA2A4@ealbek> Message-ID: <1243939016.12193.14.camel@ophelia2.g5n.co.uk> On Tue, 2009-06-02 at 00:01 -0700, Elli Albek wrote: > 3. Alternative formats: RDF? RDFa is pretty good at avoiding repetition. e.g.I like cheese.
If you like cheese (the food), then (a little more complicated):I like cheese
However, for a polling site, neither votelinks nor the example above are appropriate. You're not trying to capture the meaning that "this page is about something I like" - you're trying to capture the meaning of "this link is for voting". I don't know of a microformat, or indeed an RDF vocabulary, that covers that meaning, but I can certainly see value in creating one. -- Toby Inkster> I like cheese. >
> > If you like cheese (the food), then (a little more complicated): > >?xmlns:like="http://ontologi.es/like#" > xmlns:foaf="http://xmlns.com/foaf/0.1/"> > I like > > property="foaf:name">cheese > >
Strangely, no "I don't like" in that ontology? I see Opinion could be used for it, but... > However, for a polling site, neither votelinks nor the example above are > appropriate. You're not trying to capture the meaning that "this page is > about something I like" - you're trying to capture the meaning of "this > link is for voting". I don't know of a microformat, or indeed an RDF > vocabulary, that covers that meaning, but I can certainly see value in > creating one. > Thanks for the clarification. Since their meaning is unclear (as you and the FAQ state), why not let them be used for this purpose as well? I suppose with a class or another attribute together one could separate what is a poll link (vote for this) and what is an opinion (I have voted for this). Otherwise, if vote-links are not used for polling and there is not (it seems) a microformat for it, how to propose a new one? It would be nice to have a more general microformat for associating a target with a "label", more or less like xfn. Again, the problem of "acting on" vs "having acted on" that confused me shows up. How about something like this rough draft to follow. For acting on (polling): For having acted on (similar to vote-links)I think cheese is always blue
>> ? ? ? ? ?I like cheese. >> ? ? ? ?
> ? ? ? ? ? ?xmlns:like="http://ontologi.es/like#" >> ? ? ? ? ? xmlns:foaf="http://xmlns.com/foaf/0.1/"> >> ? ? ? ? ?I like >> ? ? ? ? ? >> ? ? ? ? ? ?> ? ? ? ? ? ? ? property="foaf:name">cheese >> ? ? ? ? ? >> ? ? ? ?
I think cheese is always blue >
I think cheese is always blue >>
Part of A Inc.
I tried to find something for this, but all I found was XFN which is just for human beings. Maybe a solution can be based upon it? Regards, Mirko From tantek at cs.stanford.edu Mon Jun 15 14:50:04 2009 From: tantek at cs.stanford.edu (=?UTF-8?Q?Tantek_=C3=87elik?=) Date: Mon Jun 15 14:50:08 2009 Subject: [uf-discuss] Wanted: a pattern indicating a subsidiary-parent-company relationship In-Reply-To:> Part of A Inc. >
Could you provide a URL to the website for company B? [1] > I tried to find something for this, but all I found was XFN which is > just for human beings. Maybe a solution can be based upon it? I've captured this area as "business to business" relationships on the XFN brainstorming page, along with POSH suggestions: http://microformats.org/wiki/xfn-brainstorming#business_to_business Thanks, Tantek [1] http://microformats.org/wiki/mailing-lists#Use_real_world_examples -- http://tantek.com/ From mirko.gustony at gmail.com Tue Jun 16 00:42:55 2009 From: mirko.gustony at gmail.com (Mirko Gustony) Date: Tue Jun 16 00:43:00 2009 Subject: [uf-discuss] Wanted: a pattern indicating a subsidiary-parent-company relationship In-Reply-To: <60cb038a0906151450s451a91f9wd6a6f15c2be8c39f@mail.gmail.com> References:>> Part of A Inc. >>
> > Could you provide a URL to the website for company B? [1] Unfortunately not for this particular use case. I am currently in the process of redesigning a website where the requirement is to have a text like the one above on every page in the header. > I've captured this area as "business to business" relationships on the > XFN brainstorming page, along with POSH suggestions: > > http://microformats.org/wiki/xfn-brainstorming#business_to_business Thanks for pointing me there. I immediately stopped reading after the stop words "human relations" :) IMHO rel="controlling" seems to be a bit misleading to me. Using my previous example:Part of A Inc.
I would understand (from the definition of rel-attribute [1]) that B Ltd. is controlling A Inc. but it should the other way round (or do I get it wrong?). So a better wording should be used (maybe "parentgroup" or something like that). Whereas rel="subsidiary" should be quite clear. Anyway: A real world example could be [2] where the corporate group links to its subsidiaries, and the subsidiaries will link back indicating the reverse relation (this looks like a lot of javascript, but you will get the idea). Another example could be [3] for the same use case. There might be another use case when companies are accredited dealers of another company (and the other way round) and want to show this using a microformat. An example could be [4]. > Thanks, > > Tantek > Regards, Mirko [1] "This attribute describes the relationship from the current document to the anchor specified by the href attribute."| Kevin Bacon |
| Kevin Bacon | >
| Kevin Bacon | >>