<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://microformats.org/wiki/index.php?action=history&amp;feed=atom&amp;title=robots-exclusion-brainstorming</id>
	<title>robots-exclusion-brainstorming - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://microformats.org/wiki/index.php?action=history&amp;feed=atom&amp;title=robots-exclusion-brainstorming"/>
	<link rel="alternate" type="text/html" href="https://microformats.org/wiki/index.php?title=robots-exclusion-brainstorming&amp;action=history"/>
	<updated>2026-04-26T22:55:36Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.38.4</generator>
	<entry>
		<id>https://microformats.org/wiki/index.php?title=robots-exclusion-brainstorming&amp;diff=33751&amp;oldid=prev</id>
		<title>Tantek: drafted, moved a proposal from robots-exclusion-issues to here.</title>
		<link rel="alternate" type="text/html" href="https://microformats.org/wiki/index.php?title=robots-exclusion-brainstorming&amp;diff=33751&amp;oldid=prev"/>
		<updated>2007-11-13T19:33:46Z</updated>

		<summary type="html">&lt;p&gt;drafted, moved a proposal from &lt;a href=&quot;/wiki/robots-exclusion-issues&quot; title=&quot;robots-exclusion-issues&quot;&gt;robots-exclusion-issues&lt;/a&gt; to here.&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&amp;lt;h1&amp;gt; robots exclusion brainstorming &amp;lt;/h1&amp;gt;&lt;br /&gt;
{{TOC-right}}&lt;br /&gt;
This page contains brainstorming, thoughts, and proposals for extending the [[robots-exclusion]] microformat.&lt;br /&gt;
&lt;br /&gt;
== specific user agents ==&lt;br /&gt;
[[robots-exclusion]] lacks the ability to allow control of specific UAs à la [http://www.robotstxt.org/wc/norobots.html A Standard for Robot Exclusion].  While this is out of scope currently since [[meta robots]] has no specific control of UAs either, here are some thoughts that have been proposed:&lt;br /&gt;
&lt;br /&gt;
If it is actually necessary to control specific UAs here is an possible soluiton.&lt;br /&gt;
Example:&lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&amp;lt;nowiki&amp;gt;&lt;br /&gt;
&amp;lt;!DOCTYPE html PUBLIC &amp;quot;-//W3C//DTD XHTML 1.0 Strict//EN&amp;quot; &amp;quot;http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;html&amp;gt;&lt;br /&gt;
&amp;lt;head&amp;gt;&lt;br /&gt;
&amp;lt;link rel=&amp;quot;schema.RobotExclusion&amp;quot; href=&amp;quot;http://example.org/.../&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;meta name=&amp;quot;RobotExclusion.RobotName1&amp;quot; content=&amp;quot;Foo Bot&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;meta name=&amp;quot;RobotExclusion.RobotName2&amp;quot; content=&amp;quot;Bar Bot&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;meta name=&amp;quot;RobotExclusion.RobotName3&amp;quot; content=&amp;quot;Evil Bot&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/head&amp;gt;&lt;br /&gt;
&amp;lt;body&amp;gt;&lt;br /&gt;
&amp;lt;h1&amp;gt;Page&amp;lt;/h1&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;robots-noindex&amp;quot;&amp;gt;This paragraph shouldn't be indexed by any bot.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;robot3-noindex&amp;quot;&amp;gt;This paragraph should be indexed by every bot except &amp;quot;Evil Bot&amp;quot;.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;robots-noindex robot1-index&amp;quot;&amp;gt;This paragraph should only be indexed by &amp;quot;Foo Bot&amp;quot;.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/body&amp;gt;&lt;br /&gt;
&amp;lt;/html&amp;gt;&lt;br /&gt;
&amp;lt;/nowiki&amp;gt;&amp;lt;/pre&amp;gt;&lt;br /&gt;
Of course it is a waste of bandwith if there are &amp;quot;RobotExclusion.RobotName&amp;quot; meta tags&lt;br /&gt;
on every page of a website. Thus this metatags should be stored on one page - perhaps the&lt;br /&gt;
main page - so they can be maintained easily. &lt;br /&gt;
&lt;br /&gt;
&amp;lt;pre&amp;gt;&amp;lt;nowiki&amp;gt;&lt;br /&gt;
&amp;lt;!DOCTYPE html PUBLIC &amp;quot;-//W3C//DTD XHTML 1.0 Strict//EN&amp;quot; &amp;quot;http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd&amp;quot;&amp;gt;&lt;br /&gt;
&amp;lt;html&amp;gt;&lt;br /&gt;
&amp;lt;head&amp;gt;&lt;br /&gt;
&amp;lt;link rel=&amp;quot;schema.RobotExclusion&amp;quot; href=&amp;quot;http://example.org/.../&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;link rel=&amp;quot;RobotExclusion.Names&amp;quot; href=&amp;quot;http://mypage.com/&amp;quot; /&amp;gt;&lt;br /&gt;
&amp;lt;/head&amp;gt;&lt;br /&gt;
&amp;lt;body&amp;gt;&lt;br /&gt;
&amp;lt;h1&amp;gt;Page&amp;lt;/h1&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;robots-noindex&amp;quot;&amp;gt;This paragraph shouldn't be indexed by any bot.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;robot3-noindex&amp;quot;&amp;gt;This paragraph should be indexed by every bot except &amp;quot;Evil Bot&amp;quot;.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;p class=&amp;quot;robots-noindex robot1-index&amp;quot;&amp;gt;This paragraph should only be indexed by &amp;quot;Foo Bot&amp;quot;.&amp;lt;/p&amp;gt;&lt;br /&gt;
&amp;lt;/div&amp;gt;&lt;br /&gt;
&amp;lt;/body&amp;gt;&lt;br /&gt;
&amp;lt;/html&amp;gt;&lt;br /&gt;
&amp;lt;/nowiki&amp;gt;&amp;lt;/pre&amp;gt;&lt;br /&gt;
&lt;br /&gt;
=== problems ===&lt;br /&gt;
* use of the &amp;lt;code&amp;gt;meta&amp;lt;/code&amp;gt; element violates the visibility [[principle]] of microformats.&lt;br /&gt;
* encoding content such as &amp;quot;robot1&amp;quot; or &amp;quot;robot3&amp;quot; into class values violates the principle of not putting content into the class attribute.&lt;br /&gt;
&lt;br /&gt;
=== poll ===&lt;br /&gt;
* -1 [[User:Tantek|Tantek]] due to the above problems, and the fact that we have yet to see much adoption of [[robots-exclusion]] in the first place, I think the proposal is both premature and flawed.&lt;br /&gt;
&lt;br /&gt;
== Habakuk extensions ==&lt;br /&gt;
* As I read this, I had the idea to use this microformat to differentiate the real content of a webpage from the rest (navigation, header, footer, ...) - you could do this by marking the &amp;quot;real content&amp;quot; with the tag &amp;quot;index&amp;quot;, but thats not really clear. Maybe you could create a new tag to mark the really important things on the page (the &amp;quot;real content&amp;quot;) from the rest. --[[User:Habakuk|Habakuk]] 03:42, 14 Jan 2007 (PST)&lt;br /&gt;
* And another idea is to mark an area of a page as independent from the rest (p.e. for listings of softwaretools - if i search for an software that can do ''a'' and ''b'' i don't want to get a result that offers me a software that can do ''a'' and another that can do ''b''). --[[User:Habakuk|Habakuk]] 03:42, 14 Jan 2007 (PST)&lt;br /&gt;
&lt;br /&gt;
== related pages ==&lt;br /&gt;
* [[robots-exclusion]]&lt;br /&gt;
* [[robots-exclusion-issues]]&lt;/div&gt;</summary>
		<author><name>Tantek</name></author>
	</entry>
</feed>