<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Edward Rayne &#187; Tutorials</title>
	<atom:link href="http://erayne.com/tag/tutorials/feed/" rel="self" type="application/rss+xml" />
	<link>http://erayne.com</link>
	<description>Web Development</description>
	<lastBuildDate>Wed, 18 Jan 2012 05:41:55 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Preventing Search Engine Access with /robots.txt</title>
		<link>http://erayne.com/blog/147/</link>
		<comments>http://erayne.com/blog/147/#comments</comments>
		<pubDate>Wed, 09 Dec 2009 00:18:06 +0000</pubDate>
		<dc:creator>Bill Ludwig</dc:creator>
				<category><![CDATA[Our Blog]]></category>
		<category><![CDATA[Best Practices]]></category>
		<category><![CDATA[Tutorials]]></category>
		<category><![CDATA[Website Development]]></category>

		<guid isPermaLink="false">http://erayne.com/?p=147</guid>
		<description><![CDATA[The /robots.txt file is simply a text file on your root directory that tells all &#8220;good&#8221; robots (the ones that aren&#8217;t fishing for email address of other spammy pursuits) what parts of your site to not index. By default all content that is accessible to a search engine spider is considered fair game. Because of [...]]]></description>
			<content:encoded><![CDATA[<p>The /robots.txt file is simply a text file on your root directory that tells all &#8220;good&#8221; robots (the ones that aren&#8217;t fishing for email address of other spammy pursuits) what parts of your site to not index. By default all content that is accessible to a search engine spider is considered fair game. Because of this, a /robots.txt file is only needed when you want to keep the search engines out of a particular directory.</p>
<p>It should also be noted that it is optional for search engines to follow the directive of the /roots.txt file. Although all major search engines tend to comply with the /robots.txt file there are plenty of spammy spiders that won&#8217;t. For that reason the ony sure fire way to protect content is to place password protection on the sensitive areas.</p>
<p>If you have something that you want to disallow search engine spiders access to, but sill allow unrestricted surfer access the best practice is to create a /robots.txt file in your root directory. That means the url to it should look like this:</p>
<p>http://www.yourwebsite.com/robots.txt</p>
<p>The following are examples of what should be contained in a /robots.txt file:</p>
<p>If you want to exclude ALL robots from your ENTIRE site:</p>
<p>User-agent: *<br />
Disallow: /<br />
If you want to exclude ALL robots from some content on your site:</p>
<p>User-agent: *<br />
Disallow: /cgi-bin/<br />
Disallow: /tmp/<br />
Disallow: /junk/<br />
If you want to disallow a single robot:</p>
<p>User-agent: EvilRobot<br />
Disallow: /<br />
IF you want to let ONE robot in:</p>
<p>User-agent: GoogleBot<br />
Disallow:<br />
User-agent: *<br />
Disallow: /<br />
The user-agent field is used to identify the bot you would like to allow/deny access. The * character is a wild card character and stands for “any”. Please note that if there is a part of your site that is private it should be password protected even if you use /robots.txt to disallow crawling of the page. /robots.txt is NOT enforceable and although legitimate spiders of search engines will follow its directives, not all bots are benign.</p>
<p>It’s also important to know that just because you have excluded content from being crawled, search engine results can still contain the urls to that content. This is because the spiders may have spotted links to your content from other sites with anchor text suggesting your uncrawled url may be good for that term. Below is a great video from <a href="http://www.mattcutts.com/blog/">Matt Cutts</a> showing how that works.</p>
<p><a href="http://www.mattcutts.com/blog/robots-txt-remove-url/">Uncrawled URLs in Search Results – Matt Cutts</a></p>
]]></content:encoded>
			<wfw:commentRss>http://erayne.com/blog/147/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>301 Redirects for SEO</title>
		<link>http://erayne.com/blog/301-redirects-seo/</link>
		<comments>http://erayne.com/blog/301-redirects-seo/#comments</comments>
		<pubDate>Tue, 08 Dec 2009 03:45:35 +0000</pubDate>
		<dc:creator>Bill Ludwig</dc:creator>
				<category><![CDATA[Our Blog]]></category>
		<category><![CDATA[.htaccess]]></category>
		<category><![CDATA[Best Practices]]></category>
		<category><![CDATA[Marketing]]></category>
		<category><![CDATA[SEO]]></category>
		<category><![CDATA[Tutorials]]></category>

		<guid isPermaLink="false">http://erayne.com/?p=15</guid>
		<description><![CDATA[One of the main problems that arise when doing SEO, updating content, or when redesigning a site, are URL’s that no longer work. When this happens not only are your customers subjected to broken links, but all search engine value the old page had is lost. Best practice is to take those old URL’s, and [...]]]></description>
			<content:encoded><![CDATA[<p>One of the main problems that arise when doing SEO, updating content, or when redesigning a site, are URL’s that no longer work. When this happens not only are your customers subjected to broken links, but all search engine value the old page had is lost.</p>
<p>Best practice is to take those old URL’s, and all their traffic and inbound links, and redirect them to a new working URL. This results in a win/win for your users and your search engine rankings.</p>
<p>That is where the 301 server redirection comes in. Without getting technical a 301 redirect is a HTTP response from the server that means that the page has permanently moved to a new URL. Setting up 301 redirection is pretty straight forward and easy to manage.</p>
<h3>301 Redirects and SEO</h3>
<p>The permenant redirection indicated by a 301 error is honored by search engines. What this means is that when a search engine encounters a 301 redirect while crawling the web they will delist the old broken URL and update all the incomming links to the new URL.</p>
<p>Here is a short video by Mike Cutts from Google explaining how Google treats 301 redirects and anchor text.</p>
<h3>301 Redirects Best Practices</h3>
<p>Currently what we suggest to all our clients is that any page that gets search engine traffic or that has a significant number of inbound links get a 301 redirect to a working page with similar or the same content. This provides the best experience for surfers and maximizes the SEO value of their existing inbound links.</p>
<p>Any old pages that don’t get traffic or that have little to no inbound links should then be sent to a well designed and helpful 404 error page. Since there is little value in these pages to be lost the purpose of this method is to maximize the user experience for those who stumble upon one of these pages.</p>
<p>Another popular alternative is to redirect these lesser used pages to the home page. Doing this allows you to direct the inbound links and search engine rankings to a valuable page on the website.</p>
<p>The downside of this method is that potential customers may follow a link expecting to find something and instead find themselves on your home page. This isn’t the best user experience and may result in a lost customer.</p>
<h3>Redirects on Apache Servers</h3>
<p>The simplist and most common ways to set up 301 redirects is through the <a href="http://httpd.apache.org/docs/2.0/howto/htaccess.html">.htaccess file</a> in the home directory of your web server. Unlike javascript redirects .htaccess redirects are invisible to the surfer because they are executed by the server when the page is delivered.</p>
<h3>.htaccess Single Page Redirects</h3>
<p>Single page redirects are best used when the content on the old page is still available at a new location. This way not only do you maintain all the inbound links and pagerank but visitors who arrive through external links find the information they were expecting.</p>
<p>When writing the redirect rule for a single page redirect you leave off the http://www and instead just include the path that would appear afterwords. For exampe to redirect from the old page of http://www.site.com/old/page.html to http://www.site.com/new/page.html you would use …</p>
<p>Redirect 301 old/page.html http://www.site.com/new/page.html</p>
<p>Other 301 Redirects</p>
<p>There are many other ways to set up 301 redirects besides using an .htaccess file. <a href="http://www.stevenhargrove.com/redirect-web-pages/">Steven Hargrove</a> has a great resource that covers all of them here. If possible I highly recommend using .htaccess though because of how easy it is to set up and maintain.</p>
]]></content:encoded>
			<wfw:commentRss>http://erayne.com/blog/301-redirects-seo/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

