<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>SEO on Tap &#187; SEO</title>
	<atom:link href="http://seoontap.com/cat/seo/feed" rel="self" type="application/rss+xml" />
	<link>http://seoontap.com</link>
	<description></description>
	<lastBuildDate>Wed, 19 Oct 2011 23:10:05 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
<xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" />
		<item>
		<title>Highlights from the Leaked 2011 Google Rating Guidelines</title>
		<link>http://seoontap.com/seo/google-quality-rater-document-2011.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=google-quality-rater-document-2011</link>
		<comments>http://seoontap.com/seo/google-quality-rater-document-2011.html#comments</comments>
		<pubDate>Wed, 19 Oct 2011 22:53:34 +0000</pubDate>
		<dc:creator>Damon</dc:creator>
				<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://seoontap.com/?p=423</guid>
		<description><![CDATA[<p>The Google Rating Guidelines first leaked by Pot Pie Girl are absolutely worth a half day of reading for any SEO. The document details how manual raters rate sites. It is important to remember that these are what manual raters are asked to look for. Machine learning algorithms like Panda are completely separate, but both [...]</p><p><a href="http://seoontap.com/seo/google-quality-rater-document-2011.html">Highlights from the Leaked 2011 Google Rating Guidelines</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></description>
			<content:encoded><![CDATA[<p>The Google Rating Guidelines first leaked by <a href="http://www.potpiegirl.com/" title="Pot Pie Girl">Pot Pie Girl</a> are absolutely worth a half day of reading for any SEO. The document details how manual raters rate sites.</p>
<p>It is important to remember that these are what manual raters are asked to look for. Machine learning algorithms like Panda are completely separate, but both manual raters and the Panda classifier are probably working to roughly the same goals.</p>
<p>Raters are given a query and a location and are then asked to rate pages for that particular query and location based on the presumed user intent for a particular query. Each sites gets one of the following ratings: Vital, Useful, Relevant, Slightly Relevant, Off-Topic or Useless, and Unrateable.</p>
<p>Sites can also be flagged as Spam, Pornography and Malicious.</p>
<p>Raters are expected to evaluate local intent based on the query location and query itself.</p>
<p>Based on screenshots within the document, a number of raters will rate a particular query/page and they are supported by administrators and moderators.</p>
<p>Here are some highlights.</p>
<h3>Six Months From Now, Lists Will Be a Spam Signal</h3>
<p>The document identifies a class of queries called &#8220;queries that ask for a list.&#8221; So go spam it to death.</p>
<blockquote><p>After typing a query, the search engine user sees a result page.  You can think of the results on the result page as a list.  Sometimes, the best results for “queries that ask for a list” are the best individual examples from that list.The page of search results itself is a nice list for users.</p>
<p>A landing page that provides links to many good individual results can also be very helpful to users.</p>
<p>“Queries that ask for a list” may be typed in singular or plural form.  For example, the query may be [bank], English (US) or [banks], English (US)</p>
</blockquote>
<p>The document lists credit cards, banks, bikes, airlines hotels and London Boutiques as examples of queries that ask for a list.</p>
<p>So now everyone will be running to create lists for this sort of query which will quickly make lists a negative quality signal forcing Google to focus on providing the list in the search results page rather than ranking pages with comprehensive lists.</p>
<h3>The Relationship between Ratings and Spam</h3>
<blockquote><p>Spam flags do not depend on a relationship between the query and the landing page.  A page should get a Spam flag if it is created using deceptive techniques &#8211; no matter what the query is or how helpful the page might be&#8230;</p>
<p>In some specific cases, it is also possible for a page to receive a Vital rating, and also be assigned a Spam flag.</p>
</blockquote>
<p>This isn&#8217;t really a big surprise, but it shows how big sites like JC Penny and BMW can spam Google and get away with it because they also get Vital or Relevant ratings for a lot of queries.</p>
<h3>Keyword Stuffing</h3>
<p>Not that I&#8217;ve ever felt the need to stuff a page full of keywords, but for clients that need some convincing&#8230;</p>
<blockquote><p>&#8220;We ask you to assign a  Spam flag if you think the number of keywords on the page is excessive and would be annoying and distracting to the real user.&#8221; p. 99</p>
</blockquote>
<p>and, somewhat more interestingly</p>
<blockquote><p>&#8220;URLs may also contain keyword stuffing. These URLs are computer-generated based on the words in the query and are often formatted with many hyphens (dashes) in them.  They are a strong spam signal.&#8221; p. 99</p>
</blockquote>
<h3>Sneaky Redirects</h3>
<p>Redirects to domains under different registrants is a spam signal.</p>
<p>If you are changing domain names, make sure you use the same registrant for both domain names. This is a particular danger for freelancers who start under their own name and then create a proper business entity or businesses that change names.</p>
<h3>What is Spam</h3>
<p>Most of the answers shouldn&#8217;t come as a big surprise, but I thought the below technical definitions to be quite instructive.</p>
<p>Feed-driven sites with PPC ads are a no-no.</p>
<blockquote><p>&#8220;A page that just contains freely available feeds and PPC ads, and was created just to make money, is spam.&#8221; p. 102</p>
</blockquote>
<p>And raters are asked to watch for template-driven sites that use keyword suggestion tools to generate related pages.</p>
<blockquote><p>Some websites use templates to mass-reproduce webpages automatically.  The content is usually copied from sources that provide such content.  You will learn to recognize templates, which usually follow a generic format or pattern.  Look for slight keyword variations that suggest automated use of a keyword suggestion tool. p. 103</p>
</blockquote>
<h3> Not all Affiliates are Thin</h3>
<p>I know a lot of people are struggling with Google&#8217;s definition of &#8220;Thin Affiliate.&#8221; So, straight from the behemoth&#8217;s mouth&#8230;</p>
<blockquote><p>Some affiliates are created to help users.  Anyone can become an “affiliate” of merchant sites such as Amazon and link to Amazon products.  Webmasters may do this to show products they like or to help users find a good deal.</p>
<p>For example, if the affiliate offers price comparison functionality, or displays product reviews, recipes, lyrics, etc., it is usually  not a thin affiliate, and, therefore, not  spam. Some  websites that offer price comparisons or other helpful shopping features, in addition to the affiliate link p. 106.</p>
</blockquote>
<p>The lesson here is to be a good website first and an affiliate second. I know, it&#8217;s not a great revelation, but it&#8217;s worth repeating.</p>
<h3>Unique Content is King, Or at Least it is not Spam</h3>
<blockquote><p>Some webpages with content are created just for the purpose of putting ads on them; writers are paid by spammers to create articles on a wide range of topics. Often the articles are very generic and don’t provide a lot of good information, but they are original. You won’t find the articles on another website. Although you may be convinced that the intent is to deceive, if the content makes sense and appears to be original, you will not be able to assign a Spam flag to such pages. You will have to use your judgment. p. 107</p>
</blockquote>
<p>Of course, having unique low quality content can still get you a low rating, you just won&#8217;t get a spam flag.</p>
<h3>Phishing&#8800;Malicious</h3>
<p>Ooh boy, I&#8217;m not sure how I feel about this. I know it will make some people very angry. But I don&#8217;t often see Phishing sites in Google, so should I care about what they tell manual raters?</p>
<blockquote><p>This landing page should make users (and raters) very suspicious and cautious.  The spelling and grammar are bad and unprofessional, and the page feels “spammy”.  What is most worrisome is that the page asks for the user’s bank password and pin number!</p>
<p>Even though we would not want to interact with the page, this type of phishing does not go against the Webspam Guidelines and the page should not be flagged as spam or malicious. p. 108</p>
</blockquote>
<h3>Conclusion, as in Google&#8217;s Conclusion, Not my Own</h3>
<blockquote><p>Remember to look at the page as a whole.  Spam pages usually have some of these characteristics:</p>
<ul>
<li>PPC ads are usually very prominent on the page, and it is obvious that the page was created for them.</li>
<li>If you do a text search, you will find that the content has been copied.</li>
<li>If you visually remove all of the spam elements from the page (PPC ads and copied content), there is nothing of any value remaining</li>
</ul>
<p>Good pages usually have these characteristics:</p>
<ul>
<li>The page is well-organized.  There may be ads on the page, but they are well identified and not distracting.</li>
<li>If you do a text search, the original page is usually the first result displayed.</li>
<li>The page will have value to the user.  A good search engine would want the page in a set of search results</li>
</ul>
</blockquote>
<p>I laughed at the good pages have ads that are not distracting part. So basically every single newspaper site in the world is a bad site for those terrible flyover ads.</p>
<p>I found this document on another site, it is not available at that site anymore. If you are looking for the document, I recommend you take an excerpt from this post and search for it in quotes in either Bing or, if you love irony, Google. <strong>Please don&#8217;t ask me for the document.</strong></p>
<p>Photo Credit: <a href="http://www.flickr.com/photos/brent_nashville/143855978/" title="Brent Moore on Flickr">Brent Moore</a> via Flickr</p>
<p><a href="http://seoontap.com/seo/google-quality-rater-document-2011.html">Highlights from the Leaked 2011 Google Rating Guidelines</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></content:encoded>
			<wfw:commentRss>http://seoontap.com/seo/google-quality-rater-document-2011.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>More on How Google Plus Changes SEO</title>
		<link>http://seoontap.com/seo/more-on-how-google-plus-changes-seo.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=more-on-how-google-plus-changes-seo</link>
		<comments>http://seoontap.com/seo/more-on-how-google-plus-changes-seo.html#comments</comments>
		<pubDate>Sat, 09 Jul 2011 00:07:24 +0000</pubDate>
		<dc:creator>Damon</dc:creator>
				<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://seoontap.com/?p=402</guid>
		<description><![CDATA[<p>Brian Chappell has an excellent post on 7 Google Plus Data Points That Could Change SEO. In a sea of repetitive reviews, it is the first intelligent thought on how Google+ could change SEO that I&#8217;ve seen. I&#8217;ve also been thinking about how Google+ changes SEO and wanted to expand on Chappell&#8217;s post with a [...]</p><p><a href="http://seoontap.com/seo/more-on-how-google-plus-changes-seo.html">More on How Google Plus Changes SEO</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></description>
			<content:encoded><![CDATA[<p>Brian Chappell has an excellent post on <a title="7 Google Plus Data Points That Could Change SEO" href="http://www.ignitesocialmedia.com/seo/google-plus-seo/">7 Google Plus Data Points That Could Change SEO</a>. In a sea of repetitive reviews, it is the first intelligent thought on how Google+ could change SEO that I&#8217;ve seen.</p>
<p>I&#8217;ve also been thinking about how Google+ changes SEO and wanted to expand on Chappell&#8217;s post with a few thoughts of my own.</p>
<h2>Profiling SEOs is Simple Thanks to Google+</h2>
<p>It&#8217;s not news that <a title="Google profiles SEOs" href="http://outspokenmedia.com/seo/google-profiles-seo-as-criminals/">Google profiles SEOs</a> and puts their projects under enhanced scrutiny. Google tries to discount viral link building by SEOs while allowing viral links by non-SEOs.</p>
<p>Well, now Google has it really easy. Even if you avoid self-identifying as an SEO in your online presence, if you are on Google+ and a friend puts you in their SEO circle, then you&#8217;ve got your own three scarlet letters.</p>
<p>If I were half as smart as I think I am, I&#8217;d have a Cafe Press link here selling “Friends don&#8217;t add friends to their Google+ SEO circle” t-shirts and mugs.</p>
<h2>Google Gets to Find Subject Matter Experts</h2>
<p>How does an SEO expert look on Google+? They probably get put in a high proportion of the Circles labeled SEO.</p>
<p>Now when one of these experts uses a +1 button, tweets or links to another document with a related keyword, Google would be well-advised to give that document some more weight.</p>
<p>If I were a betting man, and I am so pony up, Google would give expert opinions extra weight in image and video search until they can do a better job of indexing audio and non-text content.</p>
<p>Of course, if you want to be identified as an SEO expert, you have to contradict my previous advice and identify yourself as an SEO. As an SEO, if you described your relationship to Google on Facebook, it would definitely be “it&#8217;s complicated.”</p>
<h2>Profiling Authors Becomes Easier</h2>
<p>Google has already shown enough interest in giving content creators reputation scores and ranking documents that they have already <a title="published a patent on ranking authors" href="http://searchengineland.com/googles-agent-rank-patent-application-10487">published a patent on ranking authors</a>.</p>
<p>Let Circles identify the subject matter experts, then combine this information with the agent rank patent and the recent Google-Bing schema.org agreement and you can confidently rank and boost the best authors.</p>
<p>The only real obstacle I see is spammers using schema.org markup to identify their documents as being written by better known authors in the subject area. If you see Google encouraging you to link your own content across domains from you Google + profile, then you know they are struggling with it.</p>
<h2>More Anti-Spam</h2>
<p>Chappell talks about creating networks of social media accounts to artificially inflate social signals, but there is more to anti-spam than just that.</p>
<p>Setting up multiple sites to manipulate search results also gets a lot more difficult if trusted sites are expected to be owned by a well-networked Google+ account while not associated with the other spam sites.</p>
<p>Google is a long way away from making a Google+ account mandatory because they can&#8217;t be dropping small businesses with a minimal web-presence, but if a site smells funny and isn&#8217;t owned by somewhat active Google+ account, then there&#8217;s a good chance that it is spam.</p>
<p>In fact, I fully expect the next generation of Black Hat SEO to include third-party social media services that employ hundreds of third-world social media account owners to Tweet, +1 and Like so that black/grey sites can benefit from social media signals without being tied to a narrow set of IP addresses or end URLs.</p>
<p>Let me know how you think Google Plus changes SEO in the comments.</p>
<p><a href="http://seoontap.com/seo/more-on-how-google-plus-changes-seo.html">More on How Google Plus Changes SEO</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></content:encoded>
			<wfw:commentRss>http://seoontap.com/seo/more-on-how-google-plus-changes-seo.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Query Forensics: Choosing the Right Synonyms for Long-Tail Key Phrases</title>
		<link>http://seoontap.com/seo/query-forensics-choosing-the-right-synonyms-for-long-tail-key-phrases.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=query-forensics-choosing-the-right-synonyms-for-long-tail-key-phrases</link>
		<comments>http://seoontap.com/seo/query-forensics-choosing-the-right-synonyms-for-long-tail-key-phrases.html#comments</comments>
		<pubDate>Thu, 07 Jul 2011 21:46:30 +0000</pubDate>
		<dc:creator>Damon</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[Query Forensics]]></category>

		<guid isPermaLink="false">http://seoontap.com/?p=389</guid>
		<description><![CDATA[<p>In the previous query forensics post on how Google handles synonyms, we discovered that a page can rank with a synonym without even appearing in the text under the right conditions. The great news is that you can see in Google which queries are returning synonyms and use this information to find synonyms that can [...]</p><p><a href="http://seoontap.com/seo/query-forensics-choosing-the-right-synonyms-for-long-tail-key-phrases.html">Query Forensics: Choosing the Right Synonyms for Long-Tail Key Phrases</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></description>
			<content:encoded><![CDATA[<p>In the previous query forensics post on <a title="Query Forensics: Beginning Look at Synonyms in Google" href="http://seoontap.com/seo/query-forensics-beginning-synonyms-google.html">how Google handles synonyms</a>, we discovered that a page can rank with a synonym without even appearing in the text under the right conditions.</p>
<p>The great news is that you can see in Google which queries are returning synonyms and use this information to find synonyms that can rank you for the maximum number of key phrases.</p>
<p>Have a look at these results for “configure remote desktop” in Google.</p>
<p><a href="http://seoontap.com/wp-content/uploads/2011/07/google-serps-configure-remote-desktop.gif" rel="lightbox[389]"><img class="aligncenter size-medium wp-image-394" title="Google SERPs: Configure Remote Desktop" src="http://seoontap.com/wp-content/uploads/2011/07/google-serps-configure-remote-desktop-300x282.gif" alt="google serps configure remote desktop 300x282 Query Forensics: Choosing the Right Synonyms for Long Tail Key Phrases" width="300" height="282" /></a></p>
<p>The first thing to look at is the number of results returned (labeled &#8220;1&#8243; in the image). If Google returns results in the millions for a long-tail search term, there is a good chance you are looking at search results that have been expanded through synonyms.</p>
<p>The next thing to do is to look for highlighted synonyms (labeled &#8220;2&#8243; in the image) in the SERPs. While the example just goes to page 1, you will sometimes need to go deeper than the first page to find query expansion via synonyms.</p>
<p>For this particular query, “enable,” “setup,” and “set up” are all synonyms that Google is using. Because Google is checking for synonyms and it is returning millions of results, this query is being expanded with synonyms. For more competitive queries, you won&#8217;t see any query expansion.</p>
<p>The optional third step is to check the Google cache of these pages to see if the synonym candidate appears anywhere on the page. The reason we check the Google cache rather than the actual page is that if a page is ranking for a term because of backlinks, then Google shows a message at the top of the cache page.</p>
<p><a href="http://seoontap.com/wp-content/uploads/2011/07/google-cache-message.gif" rel="lightbox[389]"><img class="aligncenter size-full wp-image-395" title="Google Cache Message" src="http://seoontap.com/wp-content/uploads/2011/07/google-cache-message.gif" alt="google cache message Query Forensics: Choosing the Right Synonyms for Long Tail Key Phrases" width="706" height="31" /></a></p>
<p>In this particular example, the third document is ranking for configure because of anchor text from links pointing to the page. Configure does not appear anywhere on the page hosting the YouTube video and a number of the results in the bottom of page 1 only have the term way down in the text.</p>
<p>All results in this set either have configure on the page somewhere or in links pointing to the page except for the video search results which would suggest that synonyms are even more powerful when optimizing for video.</p>
<p>The next step is to swap out the synonyms in the query to see if we can find a key phrase that is getting enough volume not to need query expansion. What you are looking for is key phrases that are not being expanded by synonyms.</p>
<table>
<tbody>
<tr>
<th width="50%">Key Phrase</th>
<th width="30%">Documents</th>
<th width="20%">Query Expansion</th>
</tr>
<tr>
<td>enable remote desktop</td>
<td>4,670,000</td>
<td>yes</td>
</tr>
<tr>
<td>set up remote desktop</td>
<td>14,500,000</td>
<td>no</td>
</tr>
<tr>
<td>setup remote desktop</td>
<td>6,120,000</td>
<td>yes</td>
</tr>
</tbody>
</table>
<p>Based on this data, set up remote desktop is the most competitive term as it returns the highest number of pages without query expansion. While it would be advisable to have configure, setup and enable appear on the page, you can rank for these terms with links.</p>
<p>One thing to note, “set up,” “setting up” and “setup” are all highlighted in the “set up remote desktop” and “setup remote desktop” data sets. This is not query expansion via synonyms, but rather an example of query expansion via stemming.</p>
<p>There are few interesting points with this data set. The first is that in “set up remote desktop” SERPs, &#8220;set&#8221; always appears adjacent to &#8220;up&#8221; in some form or another.</p>
<p>This can&#8217;t always be the case because if they were truly equal, then they would return roughly the same number of documents, but “set up” returns 14,500,000 while “setup” returns less than half the number documents. The reason is likely that documents where “set” and “up” appear separately are also included in the “set up remote desktop” count; however, Google doesn&#8217;t show any results past 1000, so it is impossible to check with this query.</p>
<p>One really interesting query that shows how awesome Google is is “set remote desktop up” which properly stems with “set up,” “setting up” and “setup.”</p>
<p>As SEOs, we should note that “set remote desktop up” doesn&#8217;t appear as an exact match anywhere in the search results, or at least not in the top dozen or so pages. So not only is optimizing for mis-spelled words no longer effective, but optimizing for bad grammar doesn&#8217;t work anymore either.</p>
<p>In the previous Query Forensics post on synonyms, the data used the same number of keywords and better illustrates the pattern of higher-volume key phrases returning fewer results because of the lower-volume key phrases getting expanded through synonyms.</p>
<table>
<tbody>
<tr>
<th width="50%">Key Phrase</th>
<th width="30%">Documents</th>
<th width="20%">Query Expansion</th>
</tr>
<tr>
<td>cycling trails netherlands</td>
<td>60,300,000</td>
<td>yes</td>
</tr>
<tr>
<td>cycling routes netherlands</td>
<td>136,000</td>
<td>no</td>
</tr>
<tr>
<td>cycling paths netherlands</td>
<td>5,310,000</td>
<td>yes</td>
</tr>
</tbody>
</table>
<p>In this data set, &#8220;cycling routes netherlands&#8221; gets the highest volume of searches, but it returns the fewest documents. So the best target in this set is &#8220;cycling routes netherlands&#8221; while trails and paths can either be secondary on-page targets or in backlinks.</p>
<h2>Key Takeaways</h2>
<p>The technique doesn&#8217;t work for every query and in many cases I can&#8217;t figure out why one query returns more documents than another. However, for sufficiently long-tail and low-volume queries, <strong>the highest-volume key phrase is usually the one that returns the most documents while not showing signs of query expansion via synonyms</strong>.</p>
<p>Google can detect and fix poorly formed grammar and <strong>pages with poorly formed grammar exactly matching the searcher&#8217;s poorly formed search will not be returned</strong>.</p>
<p><strong>Pages can rank via synonyms if the SERPs page is showing query expansion via synonyms</strong>, but you will probably need links with that synonym. It is still better to have the synonym present on the page.</p>
<p>Image Credit: <a title="Image via Flickr" href="http://www.flickr.com/photos/gorbould/3305670476/sizes/l/in/photostream/">gorbould</a></p>
<p><a href="http://seoontap.com/seo/query-forensics-choosing-the-right-synonyms-for-long-tail-key-phrases.html">Query Forensics: Choosing the Right Synonyms for Long-Tail Key Phrases</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></content:encoded>
			<wfw:commentRss>http://seoontap.com/seo/query-forensics-choosing-the-right-synonyms-for-long-tail-key-phrases.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Query Forensics: Beginning Look at Synonyms in Google</title>
		<link>http://seoontap.com/seo/query-forensics-beginning-synonyms-google.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=query-forensics-beginning-synonyms-google</link>
		<comments>http://seoontap.com/seo/query-forensics-beginning-synonyms-google.html#comments</comments>
		<pubDate>Wed, 29 Jun 2011 21:20:06 +0000</pubDate>
		<dc:creator>Damon</dc:creator>
				<category><![CDATA[SEO]]></category>

		<guid isPermaLink="false">http://seoontap.com/?p=355</guid>
		<description><![CDATA[<p>Google uses synonyms for query expansion in most if not all queries. As an SEO, understanding when and how synonym-based query expansion works in Google is a fundamental yet poorly understood skill for any decent SEO. In this first instalment of my query forensics series, we look at a number of related queries and draw [...]</p><p><a href="http://seoontap.com/seo/query-forensics-beginning-synonyms-google.html">Query Forensics: Beginning Look at Synonyms in Google</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></description>
			<content:encoded><![CDATA[<p>Google uses synonyms for query expansion in most if not all queries. As an SEO, understanding when and how synonym-based query expansion works in Google is a fundamental yet poorly understood skill for any decent SEO. In this first instalment of my query forensics series, we look at a number of related queries and draw some beginning theories on the role synonyms play in formulating search results.</p>
<p>When Google indexes web pages, it stores both the URLs of pages that contain a particular word and keyword position information. To better understand how exact-match terms and synonyms affect SERPs, we&#8217;ll map their positions in the target document similar to how Google might index the document using the following notation: T+ = in title; T- = stem in title; Ts- = synonym in title; B+ = prominent in the body; B- = present in the body; Bs = absent from body, but a synonym is present; url = in URL; A = in backlink anchor text (when not otherwise present on page). PA and DA stand for Page Authority and Domain Authority via SEOmoz.</p>
<p>Google could certainly store more information, for instance separating the keyword in the domain from the keyword URL path or mapping the keyword in hx tags  and strong tags, but I don&#8217;t want this to get any more complex than it already is. Also, Google has only stated that they store information about where the term appears on a page in its posting lists, query expansion through synonyms could occur separately from the posting list that stores exact-match keyword data.</p>
<p>The first query we look at is <strong>Cycling Trails Netherlands</strong> (guess what I&#8217;m planning on doing for my holidays).</p>
<table>
<tbody>
<tr>
<th width="50%">Page</th>
<th colspan="3" width="30%">Search Terms</th>
<th colspan="2" width="20%">External Data</th>
</tr>
<tr>
<th></th>
<th>Cycling</th>
<th>Trails</th>
<th>Netherlands</th>
<th>PA</th>
<th>DA</th>
</tr>
<tr>
<td>http://www.cycletourer.co.uk/cycletouring/holland.shtml</td>
<td>T+;B+;URL</td>
<td>B-</td>
<td>Ts;B-</td>
<td>39</td>
<td>43</td>
</tr>
<tr>
<td>http://holland.cyclingaroundtheworld.nl/Wheretogo/WhereToGo-LongDistance.html</td>
<td>T+;B-;URL</td>
<td>Ts-;Bs</td>
<td>T+;Bs</td>
<td>32</td>
<td>42</td>
</tr>
<tr>
<td>http://holland.cyclingaroundtheworld.nl/Five-reasons.html</td>
<td>T+;B+</td>
<td>Bs</td>
<td>T+;Bs</td>
<td>28</td>
<td>42</td>
</tr>
<tr>
<td>http://www.bikely.com/listpaths/country/175</td>
<td>Ts;Bs;A</td>
<td>Ts;B+</td>
<td>T+;B+</td>
<td>46</td>
<td>70</td>
</tr>
<tr>
<td>http://en.wikipedia.org/wiki/Cycling_in_the_Netherlands</td>
<td>T+;B+;URL</td>
<td>Bs</td>
<td>T+;B+;URL</td>
<td>60</td>
<td>97</td>
</tr>
<tr>
<td>http://www.hembrow.eu/cycling/photos.html</td>
<td>T+;B+;URL</td>
<td>Bs</td>
<td>T+;B+</td>
<td>38</td>
<td>36</td>
</tr>
<tr>
<td>http://www.holland.com/uk/discoverholland/active/cycling/</td>
<td>T+;B+;URL</td>
<td>Ts;Bs</td>
<td>Ts;B-</td>
<td>47</td>
<td>75</td>
</tr>
<tr>
<td>http://en.wikiloc.com/trails/cycling/netherlands</td>
<td>T+;B+;URL</td>
<td>T+;B+;URL</td>
<td>T+;B+;URL</td>
<td>1</td>
<td>58</td>
</tr>
<tr>
<td>http://www.omnimap.com/catalog/cycling/b-neth.htm</td>
<td>T+;B+;URL</td>
<td>Bs</td>
<td>T+;B+</td>
<td>32</td>
<td>54</td>
</tr>
<tr>
<td>http://hembrow.blogspot.com/</td>
<td>T-;B+</td>
<td>Bs</td>
<td>B-</td>
<td>44</td>
<td>78</td>
</tr>
</tbody>
</table>
<p>The terms Cycling and Netherlands are obviously very important while synonyms seem to carry as much weight as Trails.</p>
<p>The lone page that is hyper-optimised for this query (http://en.wikiloc.com/trails/cycling/netherlands) doesn&#8217;t do very well. I wouldn&#8217;t be surprised if this site were hit by Panda and I suggest you look at it as a guide for what not to do, even though it is not as bad as many similarly over-optimized resources.</p>
<p>Substituting Routes for Trails (in a different browser to avoid personalised results from previous clicks) gives <strong>Cycling Routes Netherlands</strong> and it returns the same top five resources from the initial query, albeit in a different order, and six out of ten resources from the initial query appear again.</p>
<table>
<tbody>
<tr>
<th width="50%">Page</th>
<th colspan="3" width="30%">Search Terms</th>
<th colspan="2" width="20%">External Data</th>
</tr>
<tr>
<th></th>
<th>Cycling</th>
<th>Routes</th>
<th>Netherlands</th>
<th>PA</th>
<th>DA</th>
</tr>
<tr>
<td>http://holland.cyclingaroundtheworld.nl/Wheretogo/WhereToGo-LongDistance.html</td>
<td>T+;B-;URL</td>
<td>T+;B+</td>
<td>T+;Bs</td>
<td>32</td>
<td>42</td>
</tr>
<tr>
<td>http://holland.cyclingaroundtheworld.nl/Five-reasons.html</td>
<td>T+;B+</td>
<td>B+</td>
<td>T+;Bs</td>
<td>28</td>
<td>42</td>
</tr>
<tr>
<td>http://www.cycletourer.co.uk/cycletouring/holland.shtmll</td>
<td>T+;B+;URL</td>
<td>B-</td>
<td>Ts;Bs;URL</td>
<td>39</td>
<td>43</td>
</tr>
<tr>
<td>http://en.wikipedia.org/wiki/Cycling_in_the_Netherlands</td>
<td>T+;B+;URL</td>
<td>Ts-;B-</td>
<td>T+;B+;URL</td>
<td>60</td>
<td>97</td>
</tr>
<tr>
<td>http://www.holland.com/uk/discoverholland/active/cycling/</td>
<td>T+;B+;URL</td>
<td>T+;B+</td>
<td>Ts;B-</td>
<td>47</td>
<td>75</td>
</tr>
<tr>
<td>http://www.sustrans.org.uk/what-we-do/national-cycle-network/long-distance-rides/england/north-sea-cycle-route</td>
<td>T-;B+;URL</td>
<td>T+;B+</td>
<td>Ts;B+</td>
<td>45</td>
<td>81</td>
</tr>
<tr>
<td>http://www.bikely.com/listpaths/country/175</td>
<td>Ts;Bs</td>
<td>Ts;Bs</td>
<td>T+;B+</td>
<td>46</td>
<td>70</td>
</tr>
<tr>
<td>http://www.hembrow.eu/cycling/photos.html</td>
<td>T+;B+;URL</td>
<td>B-</td>
<td>T+;B+</td>
<td>38</td>
<td>36</td>
</tr>
<tr>
<td>http://www.omnimap.com/catalog/cycling/b-neth.htm</td>
<td>T-;B+;URL</td>
<td>T+;B+;URL</td>
<td>B-</td>
<td>32</td>
<td>54</td>
</tr>
<tr>
<td>http://www.sustransshop.co.uk/products/5035-nederlandse-kustroute&#8212;north-sea-cycle-route-netherlands</td>
<td>T+;B+;URL</td>
<td>T+;B+;URL</td>
<td>T+;B+;URL</td>
<td>29</td>
<td>40</td>
</tr>
</tbody>
</table>
<p>&#8220;Cycling routes Netherlands&#8221; (1,300 global monthly searches via AdWords Keyword Tool) is a more popular query term than &#8220;cycling trails Netherlands&#8221; (not enough volume to display data). As a result, documents containing the term &#8220;routes&#8221; are a lot more prominent in the search results than &#8220;trails&#8221; in the first query.</p>
<p>What Google may be doing, is it is requesting the posting lists for each of &#8220;cycling,&#8221; &#8220;trails,&#8221; and &#8220;Netherlands;&#8221; the query processor sees that trails returns too few documents and then looks for synonyms to include more documents. If this is the case, documents included via synonyms are clearly not penalized for not containing the overly-restrictive keyword.</p>
<p>This doesn&#8217;t mean that synonyms are always treated equally, only when one keyword is overly restrictive.</p>
<p>The more common &#8220;cycling routes Netherlands&#8221; query only returns 129,000 documents. The less common &#8220;cycling trails Netherlands&#8221; query returns 61,700,000. What this shows is that query expansion only occurs when necessary and the query processor only adds synonyms when necessary (rather than adding them automatically, but not ranking them as well).</p>
<p>As an SEO, you should be checking query volume against the number of documents returned by Google when deciding whether to create a new page and comparing them against other synonymous key-phrases.</p>
<p>If a query has a low query volume but returns a lot of documents, then look for a synonym outperforming an exact-match term in the results and target that term instead.</p>
<p>Based on the higher number of documents returned by the lower volume query, query expansion via synonyms in Google likely works like in the diagram below.</p>
<div id="attachment_361" class="wp-caption aligncenter" style="width: 710px"><a href="http://seoontap.com/wp-content/uploads/2011/06/synonym-query-expansion.gif" rel="lightbox[355]"><img class="size-full wp-image-361" src="http://seoontap.com/wp-content/uploads/2011/06/synonym-query-expansion.gif" alt="synonym query expansion Query Forensics: Beginning Look at Synonyms in Google" width="700" height="927" title="Query Forensics: Beginning Look at Synonyms in Google" /></a><p class="wp-caption-text">Synonym Query Expansion in Google</p></div>
<p>The obvious next step is to try to figure out where the threshold for synonym-based query expansion lies and what rules govern it so that we can know when to recommend creating separate pages to target synonymous keywords, when to target multiple synonyms on the same page and what synonym to choose.</p>
<p>&nbsp;</p>
<p><a href="http://seoontap.com/seo/query-forensics-beginning-synonyms-google.html">Query Forensics: Beginning Look at Synonyms in Google</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></content:encoded>
			<wfw:commentRss>http://seoontap.com/seo/query-forensics-beginning-synonyms-google.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Wolfram Alpha and Yahoo!: Destined for Failure</title>
		<link>http://seoontap.com/seo/wolfram-alpha-and-yahoo-destined-for-failure.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=wolfram-alpha-and-yahoo-destined-for-failure</link>
		<comments>http://seoontap.com/seo/wolfram-alpha-and-yahoo-destined-for-failure.html#comments</comments>
		<pubDate>Tue, 19 May 2009 20:32:36 +0000</pubDate>
		<dc:creator>Damon</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[search]]></category>
		<category><![CDATA[Wolfram Alpha]]></category>
		<category><![CDATA[Yahoo!]]></category>

		<guid isPermaLink="false">http://www.damongudaitis.com/?p=305</guid>
		<description><![CDATA[<p>First, the release of Wolfram Alpha, a computational knowledge engine that crawls the web to calculate the answers to questions. And now Yahoo!&#8217;s announcement that they&#8217;re going to can the links and just throw out the answers because that&#8217;s what searchers want. Bold? yes. Interesting? sure. Successful? not a chance. The problem with both Wolfram [...]</p><p><a href="http://seoontap.com/seo/wolfram-alpha-and-yahoo-destined-for-failure.html">Wolfram Alpha and Yahoo!: Destined for Failure</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></description>
			<content:encoded><![CDATA[<p><img class="aligncenter size-full wp-image-306" src="http://www.damongudaitis.com/wp-content/uploads/2009/05/titanic.jpg" alt="titanic Wolfram Alpha and Yahoo!: Destined for Failure" width="580" height="387" title="Wolfram Alpha and Yahoo!: Destined for Failure" />First, the release of <a href="http://www.wolframalpha.com/">Wolfram Alpha</a>, a computational knowledge engine that crawls the web to calculate the answers to questions. And now Yahoo!&#8217;s announcement that they&#8217;re going to <a href="http://www.wired.com/epicenter/2009/05/yahoo-announces-new-model-for-search-liveblog/">can the links and just throw out the answers</a> because that&#8217;s what searchers want.</p>
<p>Bold? yes. Interesting? sure. Successful? not a chance.</p>
<p>The problem with both Wolfram Alpha and Yahoo?&#8217;s new direction is that if they suck, they&#8217;re going to fail, and if they rock, they&#8217;re going to fail.</p>
<p>Let&#8217;s pretend Wolfram Alpha improves to the point where it works well for most queries and Yahoo? is able to do the same. Let&#8217;s even extend the fantasy to the point where both search engines gain traction and a large market share.</p>
<p>What&#8217;s going to happen? Most content producers are going to lose the incentive to produce good content.</p>
<p>Search engines crawl the web getting free content, skirting the fringe of copyright by allowing webmasters to block them if they choose, and in exchange they send free traffic to the content producers who are able to make money by selling crap or showing ads for crap.</p>
<p>A successful Wolfram Alpha and Yahoo? won&#8217;t be sending the content producers free traffic, they&#8217;ll take the best content while preventing the people who create the content from profiting.</p>
<p>Dead-tree media companies already <a href="http://www.washingtonpost.com/wp-dyn/content/article/2009/05/15/AR2009051503000_pf.html">howl about Google</a> crawling their and making it accessible to people even though they get <a href="http://daggle.com/googles-love-for-newspapers-how-little-they-appreciate-it-443">preferential treatment</a>. Stop giving them their cut and get ready to see them unleash the dogs of law before finally going all dinosaur on the world.</p>
<p>While Wolfram Alpha and Yahoo? are fighting, and losing, legal challenges, there&#8217;s only one realistic outcome. Content producers are going to block Wolfram Alpha and Yahoo? from crawling their sites.</p>
<p>It may be killing a great and useful idea, but, if someone can <a href="http://en.wikipedia.org/wiki/Who_Killed_the_Electric_Car%3F">kill the electric car</a>, then killing these two seems rather trivial and profitable.</p>
<p>Photo Credit: <a href="http://www.flickr.com/photos/moore_photography/292789772/">Moore Fun</a></p>
<p><a href="http://seoontap.com/seo/wolfram-alpha-and-yahoo-destined-for-failure.html">Wolfram Alpha and Yahoo!: Destined for Failure</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></content:encoded>
			<wfw:commentRss>http://seoontap.com/seo/wolfram-alpha-and-yahoo-destined-for-failure.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Importance of Server Speed</title>
		<link>http://seoontap.com/seo/server-speed-importance.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=server-speed-importance</link>
		<comments>http://seoontap.com/seo/server-speed-importance.html#comments</comments>
		<pubDate>Tue, 17 Feb 2009 20:51:31 +0000</pubDate>
		<dc:creator>Damon</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[crawl rate]]></category>
		<category><![CDATA[server performance]]></category>

		<guid isPermaLink="false">http://www.damongudaitis.com/?p=181</guid>
		<description><![CDATA[<p>In case you didn&#8217;t know the importance of server speed, have a careful look at the time spent downloading a page and the crawl rate in this screenshot from Google Webmaster Tools. Google Crawl Rate</p><p><a href="http://seoontap.com/seo/server-speed-importance.html">The Importance of Server Speed</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></description>
			<content:encoded><![CDATA[<p>In case you didn&#8217;t know the importance of server speed, have a careful look at the time spent downloading a page and the crawl rate in this screenshot from Google Webmaster Tools.</p>
<p><img class="size-full wp-image-182" src="http://www.damongudaitis.com/wp-content/uploads/2009/02/google-crawl-rate.png" alt="google crawl rate The Importance of Server Speed" width="530" height="393" title="The Importance of Server Speed" />Google Crawl Rate</p>
<p><a href="http://seoontap.com/seo/server-speed-importance.html">The Importance of Server Speed</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></content:encoded>
			<wfw:commentRss>http://seoontap.com/seo/server-speed-importance.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>What&#8217;s Wrong with Penalty Disclosure?</title>
		<link>http://seoontap.com/seo/google-penalty-disclosure.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=google-penalty-disclosure</link>
		<comments>http://seoontap.com/seo/google-penalty-disclosure.html#comments</comments>
		<pubDate>Tue, 03 Feb 2009 17:06:34 +0000</pubDate>
		<dc:creator>Damon</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[HYVES]]></category>

		<guid isPermaLink="false">http://www.damongudaitis.com/?p=135</guid>
		<description><![CDATA[<p>It took Google about 1 day to react to the hyves subdomain disclosure. In case you can&#8217;t be bothered to click the above link, it seems that for a short while the G-spot was disclosing whether a domain was penalized in the SERPs when you entered hyves.domain.com. This is my third site and it has [...]</p><p><a href="http://seoontap.com/seo/google-penalty-disclosure.html">What&#8217;s Wrong with Penalty Disclosure?</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></description>
			<content:encoded><![CDATA[<p>It took Google about 1 day to react to the <a href="http://www.mediadonis.net/?p=378">hyves subdomain disclosure</a>.</p>
<p>In case you can&#8217;t be bothered to click the above link, it seems that for a short while the G-spot was disclosing whether a domain was penalized in the SERPs when you entered hyves.domain.com.</p>
<p>This is my third site and it has the lowest starting page rank of any site I&#8217;ve created: PR 0. I&#8217;m curious why.</p>
<p>Of course big G has plugged this evil leak so I don&#8217;t know if I&#8217;ve been penalized. But I&#8217;m left wondering if I&#8217;ve been penalized and wondering why.</p>
<p>I haven&#8217;t done anything to deserve a penalty. I don&#8217;t really believe that I&#8217;m suffering a penalty because Google sends me traffic for long-tail terms that I deserve to rank for and nothing for competitive terms that I didn&#8217;t expect.</p>
<h3>But why can&#8217;t I check anyway?</h3>
<p>I&#8217;d like to know if my early auto industry post which generated spammy, non-reciprocated, track-backs that soon disappeared signalled my new domain as spam.</p>
<p>I originally purchased my name as a domain about 2 years ago with Bluehost.</p>
<p>I then let the registration lapse and Network Solutions started spamming me telling me my-name.com was available.</p>
<p>I was forced to do business with Network Solutions when a domain I wanted was pre-sold through an exclusive domain affiliate deal.</p>
<p>Let&#8217;s just say that imo they are part of the problem, not the solution.</p>
<p>What I&#8217;d like to know is if I&#8217;ve been penalized because Bluehost or, more likely, Network Solutions put up a spammy parking page between when I first registered my name and now.</p>
<p>I&#8217;m not bitter at Google. They are sending traffic in volumes consistent with new sites like mine. And they obfuscate Page Rank so it&#8217;s possible there isn&#8217;t any real penalty.</p>
<p>Still, I&#8217;d like to know.</p>
<p><a href="http://seoontap.com/seo/google-penalty-disclosure.html">What&#8217;s Wrong with Penalty Disclosure?</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></content:encoded>
			<wfw:commentRss>http://seoontap.com/seo/google-penalty-disclosure.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Internet Bias in Search Results</title>
		<link>http://seoontap.com/seo/internet-bias-search-results.html?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=internet-bias-search-results</link>
		<comments>http://seoontap.com/seo/internet-bias-search-results.html#comments</comments>
		<pubDate>Tue, 30 Dec 2008 18:15:51 +0000</pubDate>
		<dc:creator>Damon</dc:creator>
				<category><![CDATA[SEO]]></category>
		<category><![CDATA[serps]]></category>

		<guid isPermaLink="false">http://www.damongudaitis.com/?p=52</guid>
		<description><![CDATA[<p>Everyone is writing about how Internet-based business will win even in the face of a recession. It seems like I get at least one article a day about it in my feed reader. The problem is that everyone whoe blogs about Internet business is invested in its success, myself included. I was looking for some [...]</p><p><a href="http://seoontap.com/seo/internet-bias-search-results.html">Internet Bias in Search Results</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></description>
			<content:encoded><![CDATA[<p>Everyone is writing about how Internet-based business will win even in the face of a recession. It seems like I get at least one article a day about it in my feed reader. The problem is that everyone whoe blogs about Internet business is invested in its success, myself included.</p>
<p>I was looking for some independent information eCommerce in a recession earlier today, so I googled <em>ecommerce recession</em>.</p>
<div id="attachment_53" class="wp-caption alignnone" style="width: 590px"><a href="http://www.damongudaitis.com/wp-content/uploads/2008/12/ecommerce-recession-serps.gif" rel="lightbox[52]"><img class="size-full wp-image-53" title="SERPs for eCommerce Recession" src="http://www.damongudaitis.com/wp-content/uploads/2008/12/ecommerce-recession-serps.gif" alt="ecommerce recession serps Internet Bias in Search Results" width="580" height="648" /></a><p class="wp-caption-text">eCommerce Recession SERPS</p></div>
<p>The first page of results consists mostly of yay eCommerce pages, but I give Google credit for the #3 comScore report on an eCommerce recession. Too bad the author doesn&#8217;t know the definition of recession (hint: 2 consecutive quarters of negative growth, not 1 month).</p>
<p>The second page is even more revealing. Most of the posts cite the same two reports, one by Forrester Research and the other by Sandford-Bernstein. Unfortunately I couldn&#8217;t find the Bernstein report, and I didn&#8217;t feel like registering to maybe find the Forrester report.</p>
<p>As long as search engines use links as a ranking factor, search results for Internet-related queries are going to be biased towards pages touting the benefits of eWhatever because Internet-savvy people are the ones doing the most writing and linking and have a vested interest in promoting content that reminds them of their greatness.</p>
<p><a href="http://seoontap.com/seo/internet-bias-search-results.html">Internet Bias in Search Results</a> by <a href="http://seoontap.com">SEO on Tap</a></p>]]></content:encoded>
			<wfw:commentRss>http://seoontap.com/seo/internet-bias-search-results.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using disk: basic
Page Caching using disk: enhanced (User agent is rejected)

Served from: seoontap.com @ 2012-02-04 17:34:06 -->
