<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>API Programming &#187; Google</title>
	<atom:link href="http://www.brangle.com/wordpress/tag/google/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.brangle.com/wordpress</link>
	<description>Just another computer weblog</description>
	<lastBuildDate>Wed, 14 Dec 2011 07:35:21 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Preventing Google and Other Search Engines From Indexing Your Website Using Meta Tags</title>
		<link>http://www.brangle.com/wordpress/2009/08/blocking-google-and-other-search-engines-from-indexing-using-meta-tags/</link>
		<comments>http://www.brangle.com/wordpress/2009/08/blocking-google-and-other-search-engines-from-indexing-using-meta-tags/#comments</comments>
		<pubDate>Sun, 16 Aug 2009 08:00:20 +0000</pubDate>
		<dc:creator>lopezg</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[HTML]]></category>

		<guid isPermaLink="false">http://www.brangle.com/wordpress/?p=274</guid>
		<description><![CDATA[Not everything you post on the internet needs to be crawled, indexed, and cached by Google or any other search engines. For that very reason a robots.txt file was created, but sometimes we don't need to get to granular in the robots.txt file, or we may not have access to edit it. For that very reason we can specific information to our META tags in our web pages. The robots field is a comma separated list, if you do not set it, it will automatically default to ALL, this means that the page can be indexed and that all links on the page can be crawled. Here is the api for the robots field... [...]]]></description>
			<content:encoded><![CDATA[<p>Not everything you post on the internet needs to be crawled, indexed, and cached by Google or any other search engines.  For that very reason a robots.txt file was created, but sometimes we don&#8217;t need to get to granular in the robots.txt file, or we may not have access to edit it.  For that very reason we can specific information to our META tags in our web pages.  The robots field is a comma separated list, if you do not set it, it will automatically default to ALL, this means that the page can be indexed and that all links on the page can be crawled. Here is the api for the robots field&#8230;<span id="more-274"></span></p>
<pre>CONTENT="ALL | NONE | NOINDEX | INDEX| NOFOLLOW | FOLLOW | NOARCHIVE"
default = "ALL"
"NONE" = "NOINDEX, NOFOLLOW"</pre>
<ul>
<li>ALL &#8211; Robots are allowed to index, follow links, and archive the page</p>
<li>NONE &#8211; Robots should ignore this page, i.e. act as if this page doesn&#8217;t exist and as if they never saw it.
<li>NOINDEX &#8211; do not index this particular page
<li>INDEX &#8211; Robots should index this page
<li>NOFOLLOW &#8211; Robots can index this page, but should not follow any links on this page
<li>FOLLOW &#8211; Robots can follow any links on this page
<li>NOARCHIVE &#8211; This is a special meta field used by Google which prevents it from being archived</ul>
<p>Following the above mapping, I tried creating an example that would fit most users necessities.</p>
<ol>
<li>index this page, and any pages that I link to</li>
<li>index this page, but don&#8217;t crawl any links referenced here</li>
<li>don&#8217;t index this page, but crawl any links referenced on this page</li>
<li>don&#8217;t index this page, and don&#8217;t crawl any links referenced on this page</li>
<li>In addition to the above, we can prevent this page from being archived from Google</li>
</ol>
<p>Example #1, index this page, and any pages that it links to</p>

<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>1
2
3
4
</pre></td><td class="code"><pre class="html4strict" style="font-family:monospace;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">html</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">head</span>&gt;</span>
      <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">meta</span> <span style="color: #000066;">name</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;ROBOTS&quot;</span> <span style="color: #000066;">content</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;ALL&quot;</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">head</span>&gt;</span></pre></td></tr></table></div>

<p>Example #2, Index this page, but don&#8217;t follow any links</p>

<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>1
2
3
4
</pre></td><td class="code"><pre class="html4strict" style="font-family:monospace;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">html</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">head</span>&gt;</span>
      <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">meta</span> <span style="color: #000066;">name</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;ROBOTS&quot;</span> <span style="color: #000066;">content</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;INDEX, NOFOLLOW&quot;</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">head</span>&gt;</span></pre></td></tr></table></div>

<p>Example #3, Don&#8217;t index this page, but follow links on this page</p>

<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>1
2
3
4
</pre></td><td class="code"><pre class="html4strict" style="font-family:monospace;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">html</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">head</span>&gt;</span>
      <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">meta</span> <span style="color: #000066;">name</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;ROBOTS&quot;</span> <span style="color: #000066;">content</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;NOINDEX, FOLLOW&quot;</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">head</span>&gt;</span></pre></td></tr></table></div>

<p>Example #4, Don&#8217;t index this page, and don&#8217;t follow any links</p>

<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>1
2
3
4
</pre></td><td class="code"><pre class="html4strict" style="font-family:monospace;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">html</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">head</span>&gt;</span>
      <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">meta</span> <span style="color: #000066;">name</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;ROBOTS&quot;</span> <span style="color: #000066;">content</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;NOINDEX, NOFOLLOW&quot;</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">head</span>&gt;</span></pre></td></tr></table></div>

<p>Example #5, Lastly, assuming we do not want this page cached we can add a NOARCHIVE tag, the following example will allow the page to be indexed, all links can be crawled, but the page cannot be archived</p>

<div class="wp_syntax"><table><tr><td class="line_numbers"><pre>1
2
3
4
</pre></td><td class="code"><pre class="html4strict" style="font-family:monospace;"><span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">html</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">head</span>&gt;</span>
      <span style="color: #009900;">&lt;<span style="color: #000000; font-weight: bold;">meta</span> <span style="color: #000066;">name</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;ROBOTS&quot;</span> <span style="color: #000066;">content</span><span style="color: #66cc66;">=</span><span style="color: #ff0000;">&quot;INDEX, FOLLOW, NOARCHIVE&quot;</span>&gt;</span>
   <span style="color: #009900;">&lt;<span style="color: #66cc66;">/</span><span style="color: #000000; font-weight: bold;">head</span>&gt;</span></pre></td></tr></table></div>

<p>You can add the NOARCHIVE field to any example above to prevent it from being cached on search engines. It&#8217;s important to remember, that not being cached does not mean that your web page will not be indexed on a search engine.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.brangle.com/wordpress/2009/08/blocking-google-and-other-search-engines-from-indexing-using-meta-tags/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
	</channel>
</rss>

