<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Design Gala &#187; robots txt</title>
	<atom:link href="http://designgala.com/tag/robots-txt/feed/" rel="self" type="application/rss+xml" />
	<link>http://designgala.com</link>
	<description>Web Usability, Web Technology, User Experience</description>
	<lastBuildDate>Wed, 02 May 2012 03:55:38 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>Reduce your wordpress bandwidth and increase pagerank</title>
		<link>http://designgala.com/reduce-your-wordpress-bandwidth-and-increase-pagerank/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=reduce-your-wordpress-bandwidth-and-increase-pagerank</link>
		<comments>http://designgala.com/reduce-your-wordpress-bandwidth-and-increase-pagerank/#comments</comments>
		<pubDate>Fri, 01 Jan 2010 20:12:03 +0000</pubDate>
		<dc:creator>Mukesh Chapagain</dc:creator>
				<category><![CDATA[WordPress]]></category>
		<category><![CDATA[bot]]></category>
		<category><![CDATA[crawler]]></category>
		<category><![CDATA[robots txt]]></category>

		<guid isPermaLink="false">http://www.designgala.com/?p=922</guid>
		<description><![CDATA[Bandwidth and pagerank both are very important for any website. Bandwidth might not matter much if you are hosting with unlimited bandwidth but it&#8217;s a major issue when you have limited bandwidth and...]]></description>
			<content:encoded><![CDATA[<p><img src="http://www.designgala.com/wp-content/uploads/2010/01/bandwidth.jpg" alt="" title="" width="450" height="300" class="alignright size-full wp-image-926" /></p>
<div style="clear:both"></div>
<p>Bandwidth and pagerank both are very important for any website. Bandwidth might not matter much if you are hosting with unlimited bandwidth but it&#8217;s a major issue when you have limited bandwidth and your website is becoming hit day by day :) Google pagerank is always an important issue for any website.</p>
<p>Your site is also accessed by bots/crawlers of yahoo, google, msn, etc. They will also be using your site&#8217;s bandwidth. You can considerably reduce your site&#8217;s bandwidth by reducing their (bots) access to your pages. Bots will be crawling your wp admin pages as well which is just futile because nobody wants to show their admin pages in search results.</p>
<p>You can restrict the bots to crawl these pages. You can do this by adding meta robots tags or creating a robots.txt file on your site&#8217;s root. But for a worpress site, you need not have to worry because there is an excellent plugin called <a title="Robots Meta" href="http://wordpress.org/extend/plugins/robots-meta/" target="_blank">Robots Meta</a> which adds meta robots tags to WordPress pages. Just install it and configure.</p>
<p><a title="Robots Meta" href="http://wordpress.org/extend/plugins/robots-meta/" target="_blank">Robots Meta</a> provides a lot of options. Some of them are as under:</p>
<p>- Prevent indexing of your search result pages, while still allowing the search engines to follow the links on them, by adding noindex,follow robots meta tags.<br />
- Disallow indexing of subpages to your homepage, category pages, author pages and tag pages, to prevent duplicate content.<br />
- Prevent indexing of your login, register and admin pages by adding noindex robots meta tags.<br />
- Add noodp an noydir meta robots tags, allowing you to opt out of DMOZ and Yahoo! Directory descriptions.<br />
- Prevent Yahoo! and Google from indexing your feeds by adding a meta tag to their head-section.<br />
- Prevent indexing of just your comment feeds.<br />
- Disable author and date-based archives.<br />
- Prevent attachment pages from ranking in the search results over your articles.<br />
- Edit your .htaccess and your robots.txt from within WordPress.<br />
- Assign robots meta tags to individual posts &#038; pages.
<div id="fb-root"></div>
<p><script src="http://connect.facebook.net/en_US/all.js#xfbml=1"></script><!-- Do not remove --></p>
]]></content:encoded>
			<wfw:commentRss>http://designgala.com/reduce-your-wordpress-bandwidth-and-increase-pagerank/feed/</wfw:commentRss>
		<slash:comments>13</slash:comments>
		</item>
		<item>
		<title>Making robots.txt File</title>
		<link>http://designgala.com/making-robotstxt-file/?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=making-robotstxt-file</link>
		<comments>http://designgala.com/making-robotstxt-file/#comments</comments>
		<pubDate>Wed, 05 Apr 2006 10:53:57 +0000</pubDate>
		<dc:creator>Design Gala</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[robots txt]]></category>

		<guid isPermaLink="false">http://www.designgala.com/?p=11</guid>
		<description><![CDATA[The robots.txt file is a text file that has specific instructions for search engine robots about specific content that they are not allowed to index. These instructions tells search engine about which pages of a website should be indexed.]]></description>
			<content:encoded><![CDATA[<p><span class="firstletter"><span>T</span></span>he robots.txt file is a text file that has specific instructions for search engine robots about specific content that they are not allowed to index. These instructions tells search engine about which pages of a website should be indexed. The address of the robots.txt file is: www.yoursitename.com/robots.txt .</p>
<p>By Default, every robots at first searches for robots.txt file. It then follows the file for indexing the site content.<br />
Any robots.txt file must contain two fields User-agent and Disallow.</p>
<p class="quote_1">
User-agent: *<br />
Disallow:
</p>
<p>The User-agent field is for specifying robot name for which the access policy follows in the Disallow field. Disallow field specifies URLs which the specified robots have no access to.</p>
<p class="quote_1">
User-agent: *<br />
Disallow: /
</p>
<p>Here &#8220;*&#8221; means all robots and &#8220;/&#8221; means all URLs. This is read as, &#8220;No access for any search engine to any URL&#8221;. Since all URLs are preceded by &#8220;/ &#8221; so it bans access to all URLs when nothing follows after &#8220;/&#8221;. If partial access has to be given, only the banned URL is specified in the Disallow field.</p>
<blockquote><p> Lets consider this example:<br />
# access for MSNbot.<br />
User-agent: MSNbot<br />
Disallow:<br />
User-agent: *<br />
Disallow: /category/php/</p></blockquote>
<p>Here we see that both the fields have been repeated. Multiple commands can be given for different user agents in different lines. The above commands mean that all user agents are banned access to /category/php/ except MSNbot which has full access. Characters following # are ignored up to the line termination as they are considered to be comments.</p>
<p>Make sure that the robots.txt file is always named in all lowercase (e.g. Robots.txt or robots.Txt is incorrect)
<div id="fb-root"></div>
<p><script src="http://connect.facebook.net/en_US/all.js#xfbml=1"></script><!-- Do not remove --></p>
]]></content:encoded>
			<wfw:commentRss>http://designgala.com/making-robotstxt-file/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
	</channel>
</rss>

