<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Surf Canyon &#187; Discovery</title>
	<atom:link href="http://blog.surfcanyon.com/category/discovery/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.surfcanyon.com</link>
	<description>Unleash the Power of Search</description>
	<lastBuildDate>Wed, 25 Jan 2012 21:45:20 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.1</generator>
		<item>
		<title>Happy Birthday to Us!</title>
		<link>http://blog.surfcanyon.com/2009/02/19/happy-birthday-to-us/</link>
		<comments>http://blog.surfcanyon.com/2009/02/19/happy-birthday-to-us/#comments</comments>
		<pubDate>Thu, 19 Feb 2009 22:04:10 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[- Top Posts -]]></category>
		<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Collaboration]]></category>
		<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Personalization]]></category>
		<category><![CDATA[Testimonials]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2009/02/19/happy-birthday-to-us/</guid>
		<description><![CDATA[Surf Canyon has been around since April of 2006, however, it was one year ago today that we launched our application to some very flattering reviews. Since then, there have been a number of exciting achievements during our first year: Over 500 thousand people benefited from our search technology Over 80 million queries on Google, [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://blog.surfcanyon.com/wp-content/uploads/2009/02/birthday-cake-2.gif" title="Birthday Cake" alt="Birthday Cake" align="left" /><a href="http://www.surfcanyon.com" target="_blank">Surf Canyon</a> has been around since April of 2006, however, it was <a href="http://blog.surfcanyon.com/2008/02/19/surf-canyon-launches-discovery-engine-for-search/" target="_blank">one year ago today</a> that we launched our application to some <a href="http://blog.surfcanyon.com/2008/02/19/the-reviews-are-rave/" target="_blank">very flattering reviews</a>. Since then, there have been a number of exciting achievements during our first year:</p>
<ul>
<li>Over 500 thousand people benefited from our search technology</li>
<li>Over 80 million queries on Google, Yahoo!, Live Search and Craigslist were enhanced using our real-time re-ranking</li>
<li>Over 350 million search results were &#8220;recommended&#8221;</li>
<li>We shared the <a href="http://blog.surfcanyon.com/2009/01/13/googles-searchwiki-surf-canyon-share-the-mossberg-solutions-column/" target="_blank">Mossberg Solution</a> column in the WSJ with Google and were very well received</li>
<li><a href="http://blog.surfcanyon.com/2008/12/19/top-10-ases-for-2008/" target="_blank">ReadWriteWeb</a> selected us as one of the Top 100 Products and Top 10 Alternative Search Engines of 2008</li>
<li>Mozilla, the makers of Firefox, added us to their list of &#8220;<a href="http://blog.surfcanyon.com/2008/06/04/mozilla-recommends-surf-canyon/" target="_blank">Recommended</a>&#8221; add-ons</li>
</ul>
<p>As such, we are today officially removing the &#8220;Beta&#8221; label from our website and application.</p>
<p>The path for most every start-up is long and difficult and we continue to have a considerable amount of road before us. While we will inevitably face many new challenges, we&#8217;d like to take this opportunity to express our gratitude to some of the people who&#8217;ve helped us to get this far:</p>
<ul>
<li>Mozilla for facilitating the development of add-ons as well as providing tremendous feedback and invaluable assistance in <a href="https://addons.mozilla.org/en-US/firefox/addon/6549" target="_blank">distributing</a> the application</li>
<li>Microsoft for prominently including our application in the <a href="http://www.ieaddons.com/en/details/searchhelpers/Surf_Canyon/" target="_blank">IE Gallery</a> and for SilkRoad</li>
<li>Yahoo! for <a href="http://developer.yahoo.com/search/boss/" target="_blank">BOSS</a></li>
<li>All of the companies who&#8217;ve built amazing search engines upon which we build</li>
<li>Our <a href="http://blog.surfcanyon.com/2008/05/28/seed-funding-secured/" target="_blank">investors</a> for putting their confidence in our vision</li>
<li>All of the journalists and bloggers who took the time and made the effort to <a href="http://blog.surfcanyon.com/category/media/" target="_blank">evaluate our software</a></li>
<li>All of the people who&#8217;ve downloaded our application and the thousands of people who&#8217;ve generously offered their encouragement, feedback, <a href="http://blog.surfcanyon.com/category/testimonials/" target="_blank">praise</a>, criticisms, comments and advice</li>
</ul>
<p>While it&#8217;s fun to look back at our accomplishments, we&#8217;re now focused on the year ahead and are very excited about the future.</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=181</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>The Four Quadrants of Personalization</title>
		<link>http://blog.surfcanyon.com/2009/01/14/the-four-quadrants-of-personalization/</link>
		<comments>http://blog.surfcanyon.com/2009/01/14/the-four-quadrants-of-personalization/#comments</comments>
		<pubDate>Wed, 14 Jan 2009 18:48:56 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Personalization]]></category>
		<category><![CDATA[Research]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2009/01/14/the-four-quadrants-of-personalization/</guid>
		<description><![CDATA[Yesterday, The Mossberg Solution Column of the Wall Street Journal ran an article about Google&#8217;s SearchWiki and Surf Canyon. While we&#8217;ve released an update of our software to make sure that these two technologies are compatible and have discussed how they compliment each other, it&#8217;s perhaps worth positioning these technologies in a larger framework of [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday, <a href="http://online.wsj.com/article/SB123189045689079109.html" target="_blank">The Mossberg Solution Column</a> of the Wall Street Journal ran an article about Google&#8217;s <a href="http://googleblog.blogspot.com/2008/11/searchwiki-make-search-your-own.html" target="_blank">SearchWiki</a> and Surf Canyon. While we&#8217;ve released an update of our software to make sure that these two technologies are <a href="http://blog.surfcanyon.com/2008/11/21/v116-searchwiki-compatibility/" target="_blank">compatible</a> and have discussed how they <a href="http://blog.surfcanyon.com/2009/01/13/googles-searchwiki-surf-canyon-share-the-mossberg-solutions-column/" target="_blank">compliment each other</a>, it&#8217;s perhaps worth positioning these technologies in a larger framework of search personalization.</p>
<p>Personalization is often been divided into two <a href="http://en.wikipedia.org/wiki/Personalization" target="_blank">categories</a>, implicit and explicit, the respective merits of which have been <a href="http://www.altsearchengines.com/2008/05/21/implicit-and-explicit-personalization-in-search/" target="_blank">debated</a>. Implicit personalization is based on preferences <em>inferred </em>from behavioral information. While this doesn&#8217;t require any effort on the part of the user, making accurate determinations of intent can be challenging. On the other hand, explicit personalization is driven by <em>direct </em>indication from the user. The intent is often clearer, however, the onus is on the user to make the effort to specify preferences.</p>
<p>Since April 2006, Surf Canyon has been looking at personalization from a different perspective: real-time vs. long-term. Real-time personalization alters the user experience <em>instantly </em>as behavioral signals are collected. While determining intent &#8220;on the fly&#8221; is challenging given the requirement for speed and the paucity of data, the signals are typically very strong. Long-term personalization, by contrast, relies on the accumulation of considerable user data over a <em>significant</em> amount of time. Determining the user&#8217;s &#8220;at the moment&#8221; intent can be difficult given how quickly the signals decay and how often users change context.</p>
<p>Nevertheless, as indicated here, all of these options are currently available to the internet searcher:</p>
<p><img src="http://blog.surfcanyon.com/wp-content/uploads/2009/01/personalization-quadrants.jpg" alt="Personalization Quadrants" /></p>
<p><strong>Surf Canyon v1</strong> &#8211; Our flagship product <a href="http://blog.surfcanyon.com/2008/02/19/surf-canyon-launches-discovery-engine-for-search/" target="_blank">introduced</a> real-time implicit personalization for search. By observing the actions of the user as the search is taking place, the application helps people find information by re-ranking the results instantly, effectively transforming the search page from a static list of links to a dynamic set of results that &#8220;work with&#8221; the user.</p>
<p><strong>Google Personalization</strong> &#8211; By observing the search pattern and click history of the user over an extended period of time, Google builds a profile of the user&#8217;s long-term interests which are then used to personalize the results for future searches. The <a href="http://blog.surfcanyon.com/2007/09/19/hold-the-pickles-hold-the-lettuce/" target="_blank">first post</a> on this blog analyzed some of the benefits and shortcomings of this technology.</p>
<p><strong>Surf Canyon v2</strong> &#8211; <a href="http://blog.surfcanyon.com/2008/12/09/v200-press-release/" target="_blank">Launched</a> in December &#8217;08, v2 of Surf Canyon introduced <a href="http://my.SurfCanyon.com" target="_blank">my.SurfCanyon.com</a>, enabling users, should they so desire, to explicitly indicate sources of content they prefer as well as those that they dislike. These long-term preferences are then taken into account to, once again, personalize search results to the user&#8217;s benefit.</p>
<p><strong>Google SearchWiki</strong> &#8211; <a href="http://lifehacker.com/5095464/google-launches-searchwiki-for-custom+ordered-search-results" target="_blank">Launched</a> in November &#8217;08, SeachWiki offers controls to enable users to manually manipulate search results. By clicking a button, results may immediately be &#8220;promoted&#8221; to the top of the search page or deleted altogether. The next time the user runs the same search, the user&#8217;s personal modifications will be displayed.</p>
<p>Different users will have varying appreciations for the costs and benefits associated with each of these technologies, however, they are all compatible and, to a large extent, compliment each other. Today&#8217;s internet searchers are therefore free to use all of them, some of them or none of them, as they prefer.</p>
<p>Update (1/15/09) &#8211; Charles Knight at <a href="http://www.altsearchengines.com/2009/01/15/the-four-quadrants-of-personalization/" target="_blank">AltSearchEngines</a> ran this post under their &#8220;Guest Authors&#8221; series.</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=170</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Google’s SearchWiki &amp; Surf Canyon Share The Mossberg Solution Column</title>
		<link>http://blog.surfcanyon.com/2009/01/13/googles-searchwiki-surf-canyon-share-the-mossberg-solutions-column/</link>
		<comments>http://blog.surfcanyon.com/2009/01/13/googles-searchwiki-surf-canyon-share-the-mossberg-solutions-column/#comments</comments>
		<pubDate>Wed, 14 Jan 2009 01:29:39 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[- Top Posts -]]></category>
		<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Personalization]]></category>
		<category><![CDATA[Testimonials]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2009/01/13/googles-searchwiki-surf-canyon-share-the-mossberg-solutions-column/</guid>
		<description><![CDATA[Katherine Boehret at the Wall Street Journal wrote this week&#8217;s The Mossberg Solution column, which will appear in tomorrow&#8217;s print edition, entitled &#8220;Web Searches That Really Bear Fruit.&#8221; The subject, two &#8220;new free tools [that] aim to make online results more relevant by tracking your reactions,&#8221; is Google&#8217;s SearchWiki and Surf Canyon. We are naturally [...]]]></description>
			<content:encoded><![CDATA[<p>Katherine Boehret at the Wall Street Journal wrote this week&#8217;s <a href="http://online.wsj.com/article/SB123189045689079109.html" target="_blank">The Mossberg Solution</a> column, which will appear in tomorrow&#8217;s print edition, entitled &#8220;<a href="http://solution.allthingsd.com/20090113/web-searches-that-really-bear-fruit/" target="_blank">Web Searches That Really Bear Fruit</a>.&#8221; The subject, two &#8220;new free tools [that] aim to make online results more relevant by tracking your reactions,&#8221; is Google&#8217;s <a href="http://googleblog.blogspot.com/2008/11/searchwiki-make-search-your-own.html" target="_blank">SearchWiki</a> and Surf Canyon.</p>
<p>We are naturally thrilled to be sharing a such prominent column with a company we esteem so highly. Here are some excerpts:</p>
<blockquote><p>&#8220;[SearchWiki and Surf Canyon] don’t necessarily compete against each other; in fact, they can be used in tandem. But after initially entering a search query, SearchWiki requires additional work on the part of the user that many people may not want to do. Surf Canyon works automatically as you go, sorting results according to real-time user behavior.</p>
<p>If you like the idea of more personalized Web searches but would like to use other search engines or don’t want to do extra work, you might like Surf Canyon.</p>
<p>Google’s SearchWiki is asking users to do extra work, which may not be practical for many users. But if you do use it, this tool’s personalized, saved results could be a real boon. Surf Canyon worked well for me with multiple search engines, retrieving data from result pages I likely wouldn’t have opened. Either way, your days of futile Web searching are numbered.&#8221;</p></blockquote>
<p>The real-time <em>implicit </em>personalization offered by Surf Canyon and the real-time <em>explicit</em> personalization offered by SearchWiki are both <a href="http://blog.surfcanyon.com/2008/11/21/v116-searchwiki-compatibility/" target="_blank">compatible</a> and complimentary. You don&#8217;t have to pick one or the other when you can have both!</p>
<p><img src="http://blog.surfcanyon.com/wp-content/uploads/2009/01/wsj-screen-shot-with-shadow.jpg" alt="Wall Street Journal Screen Shot with Shadow" /></p>
<p>Update (1/14/09) &#8211; Katherine Boehret added a video. Here is what she had to say at the 3m20s mark:</p>
<blockquote><p>&#8220;At the end of the day, Google SearchWiki makes you do a lot of work. It might be beneficial for some people who want to save their searches, but Surf Canyon works with you as you go, on the fly. It works automatically and doesn’t make you do anything extra other than what you’re already doing.&#8221;</p></blockquote>
<p><embed src="http://s.wsj.net/media/swf/main.swf" bgcolor="#FFFFFF" flashvars="videoGUID={86C72F50-978D-4B19-8892-D33A657F1131}&amp;playerid=1000&amp;plyMediaEnabled=1&amp;configURL=http://wsj.vo.llnwd.net/o28/players/&amp;autoStart=false” base=" name="flashPlayer" seamlesstabbing="false" type="application/x-shockwave-flash" swliveconnect="true" pluginspage="http://www.macromedia.com/shockwave/download/index.cgi?P1_Prod_Version=ShockwaveFlash" width="512" height="363"></embed></p>
<p>Update (2/18/09) &#8211; <a href="http://www.blinnpr.com/" target="_blank">Steven Blinn</a>, PR consultant, talks with David Ward at <a href="http://www.prweekus.com/WSJ-drives-Surf-Canyon-downloads/article/127540/" target="_blank">PRWeek</a> about getting coverage for Surf Canyon in the Wall Street Journal.</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=166</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Evaluating Surf Canyon’s Technology (Part 2)</title>
		<link>http://blog.surfcanyon.com/2008/12/01/evaluating-surf-canyons-technology-part-2/</link>
		<comments>http://blog.surfcanyon.com/2008/12/01/evaluating-surf-canyons-technology-part-2/#comments</comments>
		<pubDate>Mon, 01 Dec 2008 16:44:57 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[- Top Posts -]]></category>
		<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Personalization]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Tutorials]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2008/12/01/evaluating-surf-canyon%e2%80%99s-technology-part-2/</guid>
		<description><![CDATA[In a Part I, we began discussing some quantitative evaluations of the technology reported in our research paper.  The goal in these studies is to see if search engine users get any value out of real-time implicit personalization and, if so, to find metrics that we can use to quantify this value. One of the [...]]]></description>
			<content:encoded><![CDATA[<p>In a <a href="http://blog.surfcanyon.com/2008/10/14/evaluating-surf-canyons-technology-part-1/" target="_blank">Part I</a>, we began discussing some quantitative evaluations of the technology reported in our <strong><a href="http://www.surfcanyon.com/SurfCanyonDemonstrationResearchPaper.pdf" target="_blank">research paper</a></strong>.  The goal in these studies is to see if search engine users get any value out of real-time implicit personalization and, if so, to find metrics that we can use to quantify this value.</p>
<p>One of the most useful techniques for comparing the quality of search engine retrieval functions is the technique of result interleaving, invented by <a href="http://www.cs.cornell.edu/People/tj/" target="_blank">Thorsten Joachims</a> of Cornell University. He first introduced the idea in a <a href="http://www.cs.cornell.edu/People/tj/publications/joachims_02b.pdf" target="_blank">2002 paper</a> and has since recently <a href="http://www.cs.cornell.edu/People/tj/publications/radlinski_etal_08b.pdf" target="_blank">expanded</a> on the idea.</p>
<p>A search engine retrieval function is an algorithm that produces a ranked list of documents given a document collection and a user query. The retrieval function is the secret sauce behind the search engine. It is reported that Google, for instance, considers over 200 different document features when ranking web pages in response to a user query. These features are fed into the retrieval function which tells the web application which links to present and it what order.  In an open collection, such as the World Wide Web, different retrieval functions can produce both different document orderings as well as entirely different sets of documents.</p>
<p>Joachims came up with a very simple test that answers the following question: Given two retrieval functions, which does a search engine user prefer? His idea was that one can interleave the results of the two retrieval functions in an unbiased fashion and then count the user clicks on the links contributed by each retrieval function. The better retrieval function is the one that gets the most clicks.</p>
<p>For instance, assume that we have four documents (A, B, C, and D) that are relevant to a given query. According to the first retrieval function, r1, they should be ordered C-A-D-B. According to the second retrieval function, r2, they should be ordered D-A-C-B. The interleaved order of presentation would be D-C-A-B half the time and C-D-A-B half the time. We need to assure that each retrieval function gets to determine the document in the top spot half the time in order to have an unbiased test.  We would then show these document lists to many users as they conduct searches for this query. If we found that there were more user clicks on document C compared to document D, we can state that users prefer retrieval function r1. We can repeat this test for more documents and more queries, but by simply counting the clicks on documents contributed by each retrieval function we can determine an absolute user preference.</p>
<p>Surf Canyon implemented this test to compare our retrieval function, which employs real-time personalization based on implicit relevance feedback, with Google&#8217;s retrieval function. We always show the user the top 10 Google results, even with our application installed, so our interleaving test was only done when users asked for a second page of search results. In those cases, the results presented to the user would be a mixture of results 11 through 15 from Google and the most highly ranked personalized results from Surf Canyon&#8217;s retrieval function.</p>
<p>The figure below shows the ratio of link clicks on Surf Canyon results compared to Google results as a function of the number of search results selected by the searcher before preceding to the second page of results. A ratio less than 1.0 would indicate that they prefer un-personalized Google results, whereas a ratio greater than 1.0 would indicate a preference for Surf Canyon&#8217;s retrieval algorithm. A ratio of 1.0 would indicate no user preference. Note that the users do not know if they are selecting personalized or un-personalized links. The result is a very clear preference for Surf Canyon&#8217;s retrieval algorithm – users are 30-40% more likely to select Surf Canyon links.</p>
<p><img src="http://blog.surfcanyon.com/wp-content/uploads/2008/11/research-paper-fig-6.jpg" alt="Research Paper Fig 6" /><br />
We looked at this quantity versus the number of search results selected because the user&#8217;s interactions with the search page are what we use to personalize the results. The more the user selects, the more confident we are about the user&#8217;s true intent. Interestingly, users also prefer Surf Canyon results, by a significant margin, even when they skip the top 10 Google links entirely. When a user skips a link, we generally assume that the document is not what the user wanted. If the user skips all 10 links, we assume that the search engine misinterpreted the user intent and we start looking for different content that is not represented in the top 10 links.</p>
<p>Even though only 10% of searchers ever venture beyond page 1, we consider a 30-40% improvement in page 2 click-through rates to be significant. Quantitatively measuring the value delivered by real-time implicit personalization to page 1 results is, unfortunately, considerably more difficult. Nevertheless, be believe that these page 2 results are indicative of the value that real-time implicit personalization can delivery to page 1 results as well.</p>
<p>Update (7/15/09) &#8211; Our <a href="http://www.surfcanyon.com/SurfCanyonDemonstrationResearchPaper.pdf" target="_blank">research paper</a>,  “Demonstration of Improved Search Result Relevancy Using Real-Time  Implicit Relevance Feedback,” was selected for oral presentation at <a href="http://blog.surfcanyon.com/2009/07/15/selected-for-oral-presentation-at-sigir-09/" target="_blank">SIGIR &#8217;09</a>.</p>
<p>Update (12/18/09) &#8211; Our <a href="http://www.surfcanyon.com/SurfCanyonDemonstrationResearchPaper.pdf" target="_blank">research paper</a> was published by <a href="http://blog.surfcanyon.com/2009/07/15/selected-for-oral-presentation-at-sigir-09/" target="_blank">SIGIR</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=153</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Evaluating Surf Canyon&#8217;s Technology (Part 1)</title>
		<link>http://blog.surfcanyon.com/2008/10/14/evaluating-surf-canyons-technology-part-1/</link>
		<comments>http://blog.surfcanyon.com/2008/10/14/evaluating-surf-canyons-technology-part-1/#comments</comments>
		<pubDate>Tue, 14 Oct 2008 21:02:49 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Personalization]]></category>
		<category><![CDATA[Research]]></category>
		<category><![CDATA[Tutorials]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2008/10/14/evaluating-surf-canyons-technology-part-1/</guid>
		<description><![CDATA[For the past 2½ years, Surf Canyon has been working to improve the web search experience. In particular, we feel that search results individualized to each user and their current context will prove superior to search results that are not personalized to real-time intent. Our hypothesis is that real-time personalization works and the  challenge for [...]]]></description>
			<content:encoded><![CDATA[<p>For the past 2½ years, <a href="http://www.SurfCanyon.com" target="_blank">Surf Canyon</a> has been working to improve the web search experience. In particular, we feel that search results individualized to each user and their current context will prove superior to search results that are not personalized to real-time intent. Our hypothesis is that real-time personalization works and the  challenge for us is to thus prove this statement, quantify the improvements and use the data that we gather to improve our application even further.</p>
<p>Quantifying the “web search experience” is, however, very challenging. Nevertheless, search engines are constantly running <a href="http://googleblog.blogspot.com/2008/08/search-experiments-large-and-small.html" target="_blank">small (and large) experiments</a> to test how changes affect the user search experience. These experiments, which often use something called “A/B” or “bucket” testing, entail exposing a small, randomly selected set of users to the new features or changes and then comparing their behavior to the behavior of users on the baseline search site. Depending on the feature being tested, different user behavior signals are used to judge user satisfaction with the changes.</p>
<p>Since our technology radically changes the nature of the search results page, evaluating the application is particularly difficult. Once a user installs the Surf Canyon application, the search engine results page becomes dynamic and personalized to each user. Users who install Surf Canyon expect to get the Surf Canyon technology, so a traditional bucket test is not possible. (If we were to have a control sample of users who installed a special version of Surf Canyon that did not personalize their search results, those users would be perplexed and would probably uninstall the product.)</p>
<p>However, we performed a thorough evaluation of the technology using some traditional evaluation metrics from the Information Retrieval community as well as some new evaluation techniques that we invented ourselves. These evaluations are documented in a <strong><a href="http://www.surfcanyon.com/SurfCanyonDemonstrationResearchPaper.pdf" target="_blank">research paper</a></strong> which we recently drafted<a href="http://www.cikm2008.org/" target="_blank"></a>. In a subsequent blog post, we will detail our evaluation methodologies and the conclusions of these studies.</p>
<p>A good presentation should naturally start with the conclusions, so we reveal here in advance the conclusion of our studies so far: real-time personalization works.</p>
<p>Update (12/1/08) &#8211; Continued in <a href="http://blog.surfcanyon.com/2008/12/01/evaluating-surf-canyon%E2%80%99s-technology-part-2/" target="_blank">Part 2</a>.</p>
<p>Update (7/15/09) &#8211; Our <a href="http://www.surfcanyon.com/SurfCanyonDemonstrationResearchPaper.pdf" target="_blank">research paper</a>, “Demonstration of Improved Search Result Relevancy Using Real-Time Implicit Relevance Feedback,” was selected for oral presentation at <a href="http://blog.surfcanyon.com/2009/07/15/selected-for-oral-presentation-at-sigir-09/" target="_blank">SIGIR &#8217;09</a>.</p>
<p>Update (12/18/09) &#8211; Our <a href="http://www.surfcanyon.com/SurfCanyonDemonstrationResearchPaper.pdf" target="_blank">research paper</a> was published by <a href="http://blog.surfcanyon.com/2009/07/15/selected-for-oral-presentation-at-sigir-09/" target="_blank">SIGIR</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=140</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Problem: Too Many Results</title>
		<link>http://blog.surfcanyon.com/2008/07/29/the-problem-too-many-results/</link>
		<comments>http://blog.surfcanyon.com/2008/07/29/the-problem-too-many-results/#comments</comments>
		<pubDate>Wed, 30 Jul 2008 00:31:20 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Personalization]]></category>
		<category><![CDATA[Reformulation]]></category>
		<category><![CDATA[Research]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2008/07/29/the-problem-too-many-results/</guid>
		<description><![CDATA[Cuil just launched their search engine which boasts the largest index on the internet with 120 billion pages. (121,617,892,992 to be precise, as posted on their home page.) While the exact numbers are not always made available, Google, Yahoo! and Microsoft also all have 10s of billions of pages in their indexes. Having as comprehensive [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.cuil.com" target="_blank">Cuil</a> just launched their search engine which boasts the <a href="http://searchengineland.com/080728-000100.php" target="_blank">largest index</a> on the internet with 120 billion pages. (121,617,892,992 to be precise, as posted on their home page.) While the exact numbers are not always made available, Google, Yahoo! and Microsoft also all have 10s of billions of pages in their indexes. Having as comprehensive an index as possible is a fabulous thing, and a very important prerequisite for search since you can&#8217;t find anything if it&#8217;s not in there, but it does not solve the problem of putting 10 relevant results on page one.</p>
<p>In their paper entitled “<a href="http://research.microsoft.com/%7Esdumais/PIA2005-final.pdf" target="_blank">Beyond the Commons</a>: Investigating the Value of Personalizing Web Search,” Teevan et al. make the observation that:</p>
<blockquote><p>&#8220;Web queries are very short, and it is unlikely that a two- or three-word query can unambiguously describe a user’s informational goal.&#8221;</p></blockquote>
<p>Ambiguous intent combined with an exploding quantity of content on the internet makes it increasingly difficult to put all of the relevant results on page one while simultaneously eliminating those that are not pertinent.</p>
<p>Very few people <a href="http://blog.surfcanyon.com/2008/02/18/hidden-treasures/" target="_blank">venture past the first page</a> of search results to find what they want, so returning hundreds of thousands or even millions of results is of little value to the user. (You cannot look past the first 1000 even if you wanted to.) Even if the user is particularly motivated, the process of digging through page after page of results is nothing short of tedious, which is the reason users will either quickly turn to <a href="http://blog.surfcanyon.com/2008/01/16/maybe-if-i-just-add-some-quotes%e2%80%a6/" target="_blank">reformulating their query or abandoning the search</a>.</p>
<p>The problem is too many results!</p>
<p>The solution to the conundrum is to have a greater understanding of the user&#8217;s intent in order to more precisely focus the results. One way to achieve this is to get the user to explicitly specify intent by entering more keywords, although getting people to change behavior is not easy. Another way to achieve this is to implicitly infer intent through the type of long-term personalization offered by Google, although this too has a number <a href="http://blog.surfcanyon.com/2007/09/19/hold-the-pickles-hold-the-lettuce/" target="_blank">shortcomings</a>.</p>
<p>The most effective way to resolve this issue is to implicitly infer intent from real-time behavior signals and then immediately re-rank the results, through the use of instantaneous relevancy calculations, so that the most pertinent results are moved to the top while the less relevant are suppressed. <a href="http://www.SurfCanyon.com" target="_blank">Surf Canyon</a>&#8216;s Discovery for Search<font size="4">™</font> is such a solution. Disambiguating intent &#8220;on the fly&#8221; not only enables users to continue searching with their current behavior, but no search histories or profiles are required. Furthermore, the signals are strong so that the results can be reordered dramatically and the user can actually &#8220;see&#8221; the process working, creating a more encouraging and perhaps entertaining search experience.</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=109</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Surf Canyon Launches Discovery Engine for Search</title>
		<link>http://blog.surfcanyon.com/2008/02/19/surf-canyon-launches-discovery-engine-for-search/</link>
		<comments>http://blog.surfcanyon.com/2008/02/19/surf-canyon-launches-discovery-engine-for-search/#comments</comments>
		<pubDate>Tue, 19 Feb 2008 12:42:41 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[Announcements]]></category>
		<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Media]]></category>
		<category><![CDATA[Press Releases]]></category>
		<category><![CDATA[Testimonials]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2008/02/19/surf-canyon-launches-discovery-engine-for-search/</guid>
		<description><![CDATA[Surf Canyon’s browser extension significantly accelerates the search process, enables searchers to find relevant information buried among results OAKLAND, CALIF. February 19, 2008 – Surf Canyon today announced the official release of its Discovery Engine for Search™ at www.SurfCanyon.com. The browser extension for Internet Explorer and Firefox, designed to disambiguate user queries in real time, [...]]]></description>
			<content:encoded><![CDATA[<p align="center"><em>Surf Canyon’s browser extension significantly accelerates the search process, enables searchers to find relevant information buried among results</em></p>
<p><strong>OAKLAND, CALIF.</strong> February 19, 2008 – Surf Canyon today announced the official release of its <strong>Discovery Engine for Search™</strong><span style="font-size: 8pt; font-family: 'Tahoma','sans-serif'"></span> at <a href="http://www.SurfCanyon.com" target="_blank">www.SurfCanyon.com</a>. The browser extension for Internet Explorer and Firefox, designed to disambiguate user queries in real time, significantly accelerates the search process.</p>
<p>As the volume of content on the internet expands exponentially, so has the challenge of sifting through mountains of search results in order to pinpoint the most relevant. By using Surf Canyon’s semantic real-time implicit personalization technology while searching on Google, Yahoo! or MSN, searchers are able to more quickly and easily find relevant information buried within the usually overwhelming quantity of results. Rather than leaving users to manually filter through static sets of links, this innovative application uses real-time behavior signals to calculate “instantaneous relevancies” within a result set in order to dynamically pull forward the most pertinent results while pushing back those less germane. The technology is similarly suited for targeting sponsored links, benefiting both users and advertisers.</p>
<p>“Like many innovations, the concept was born out of frustration,” said Mark Cramer, Surf Canyon CEO. “While existing search technology is certainly more than adequate for many needs, when looking for something relatively obscure I often felt somewhat alone digging through all the results and, as such, decided that I could benefit from a search technology that would work with me during the process. With our product, search is no longer a static lecture being pushed to the user, but a participatory conversation that yields better and faster results.”</p>
<p>“Surf Canyon significantly improves my search experience and the deeper results it delivers are spot on virtually every time,” said Carla Thompson, senior analyst with The Guidewire Group. “I also really like that it functions within my existing search; once installed, I’m able to enjoy the advantages without having to adjust my habits.”</p>
<p><strong>About Surf Canyon</strong><br />
Surf Canyon develops real-time implicit personalization technology for internet search. Its patents-pending <strong>Discovery Engine for Search™</strong> transforms static result pages into dynamic knowledge resources, enabling users to more quickly and easily find pertinent information buried among all the irrelevant results, significantly accelerating the search process.<br />
Surf Canyon has headquarters in Oakland, California and is recognized by DEMO.com as an “Innovator to Watch.”</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=29</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
		<item>
		<title>Hidden Treasures</title>
		<link>http://blog.surfcanyon.com/2008/02/18/hidden-treasures/</link>
		<comments>http://blog.surfcanyon.com/2008/02/18/hidden-treasures/#comments</comments>
		<pubDate>Mon, 18 Feb 2008 20:25:09 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Personalization]]></category>
		<category><![CDATA[Research]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2008/02/18/hidden-treasures/</guid>
		<description><![CDATA[It has been observed for some time that search engine users rarely venture past the first page of results. Various studies have shown that only 10% of the time does a user click on a result with a rank greater than 10. A typical user would thus rather reformulate the query or abandon the search [...]]]></description>
			<content:encoded><![CDATA[<p>It has been observed for some time that search engine users rarely venture past the first page of results.  Various studies have shown that only 10% of the time does a user click on a result with a rank greater than 10.  A typical user would thus rather reformulate the query or abandon the search when they can&#8217;t find what they need on page one.</p>
<p>This is particularly problematic since, as discussed <a href="http://blog.surfcanyon.com/2007/09/19/hold-the-pickles-hold-the-lettuce/" target="_blank">previously</a>, it’s increasingly difficult, if not impossible, to map two- and three-word queries to the exploding content on the internet. It is thus <a href="http://blog.surfcanyon.com/2008/01/16/maybe-if-i-just-add-some-quotes%e2%80%a6/" target="_blank">increasing unlikely</a> that the top 10 will contain the information sought without some form of <a href="http://blog.surfcanyon.com/2008/02/05/dont-ask-just-do-it/" target="_blank">additional input</a> from the user, either explicitly (reformulation suggestions or clusters) or implicitly (personalization).</p>
<p>But what if there was a search technology that was able to dig into the results and fetch the relevant information for you?</p>
<p>Surf Canyon&#8217;s Discovery for Search™ technology is designed to find the “hidden treasures” in the subsequent search results, and Beta testing has confirmed this. The graph below compares the initial rank of a result (x-axis) with the frequency with which it is clicked (y-axis). The AOL data is from 2006 and the Surf Canyon data is from the most recent 10,000 queries. The result is that Surf Canyon doubles (17% vs. 9%) the users’ ability to find relevant information buried in the search results.</p>
<p><a href="http://blog.surfcanyon.com/wp-content/uploads/2008/02/sc-vs-aol.jpg" title="SC vs AOL"><img src="http://blog.surfcanyon.com/wp-content/uploads/2008/02/sc-vs-aol.jpg" alt="SC vs AOL" /></a></p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=27</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Don&#8217;t ask&#8230; just do it!</title>
		<link>http://blog.surfcanyon.com/2008/02/05/dont-ask-just-do-it/</link>
		<comments>http://blog.surfcanyon.com/2008/02/05/dont-ask-just-do-it/#comments</comments>
		<pubDate>Wed, 06 Feb 2008 00:04:48 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Personalization]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2008/02/05/dont-ask-just-do-it/</guid>
		<description><![CDATA[“Personalization,” which is the tailoring of a product to the specific interests and needs of an individual, has been widely heralded as an important web innovation as search engines, and websites in general, work to deliver greater value to users. There are, in general, two types of personalization, explicit and implicit, which must both be [...]]]></description>
			<content:encoded><![CDATA[<p>“Personalization,” which is the tailoring of a product to the specific interests and needs of an individual, has been widely heralded as an important web innovation as search engines, and websites in general, work to deliver greater value to users. There are, in general, two types of personalization, explicit and implicit, which must <em>both </em>be employed to achieve the greatest benefit for the user. Through the examination of some of the limitations of the former, however, we can gain a greater appreciation for the importance of the latter.</p>
<ul>
<li><strong>Explicit personalization</strong>, also referred to as “customization,” is the process through which users shape their experiences by <em>actively </em>selecting preferences or responding to questions. Within search, voting for, rating or deleting particular results, or any adjustment on a ‘Preferences’ or ‘Options’ page, are obvious examples of explicit personalization. Less obvious, however, are things such as presenting suggested search terms or similar queries to the user, such as with <a href="http://tools.search.yahoo.com/newsearch/searchassist" target="_blank">Yahoo! Search Assist</a> or <a href="http://www.google.com/webhp?complete=1&amp;hl=en" target="_blank">Google Suggest</a>. Clustering (the process of suggesting possible query reformulations through the analysis of similarities between documents in the result set) is also a form of explicit personalization. In all of these cases the onus is upon to user to make a determination as to the path forward and then actively select an option.</li>
</ul>
<ul>
<li><strong>Implicit personalization</strong> is the process through which the users’ experiences are shaped by inferences of intent generated through the careful observation of behavior. <a href="http://www.google.com/psearch" target="_blank">Google’s Personalized Search</a>, which delivers a <a href="http://searchengineland.com/070202-224617.php" target="_blank">modified</a> set of results based on past search activity, is a clear example. Leveraging behavioral analysis or social relationships to target anything from advertising to search results to any other sort of content is also implicit personalization.</li>
</ul>
<p>One is not better than the other. They are simply different and, to a great extent, compatible. Explicit personalization by itself, however, suffers from a number of limitations that can be alleviated through the addition of implicit personalization:</p>
<ol>
<li>Explicit personalization, by definition, requires effort on the part of the user and many people merely will not bother. It seems that users want to enter their queries and then simply go straight to the results, <a href="http://searchengineland.com/070921-070852.php" target="_blank">scanning from top to bottom</a>.</li>
<li>Even if users are willing to make the effort, they don’t necessarily know with enough precision what it is that they want. Furthermore, in some cases they might not yet possess enough knowledge to comprehend the clusters or suggestions. While clusters may sometimes help with reformulation, since it is often easier to recognize terms rather than recall them, search is frequently a process of discovery, with each result leading to better understanding of the subject matter and a greater ability to express intent. Knowing all of this <em>a priori</em> is not always possible.</li>
<li>Explicit personalization through a classification or clustering of search results may also create additional work if the user’s intent spans multiple contexts or sub-contexts. Selecting a cluster, for example, will enable the user to ‘zoom in’ on a particular sub-context. However, if the sub-context is not a precise match, or if the nature of the search requires a body of information that covers multiple sub-contexts, then the user will have to expend additional effort going up and down the classification tree.</li>
<li>Even if users think they know what they want, they still might be mistaken. Or worse, either intentionally or not, they are not always <a href="http://online.wsj.com/public/article/SB114961753581872822-rR3vT0iNg5HrPfdR6tdSAbRAZXA_20060711.html?mod=tff_main_tff_top" target="_blank">honest with themselves</a>. Based on the ability to observe user behavior on all of Yahoo!’s properties, <a href="http://money.cnn.com/magazines/business2/business2_archive/2007/03/01/8401043/index.htm?postversion=2007022615" target="_blank">Dr. Usama Fayyad stated</a>, “I know more about your intent than any 1000 keywords you could type,” which of course alludes to the difficulty of both accurately knowing and then expressing one’s own intent.</li>
</ol>
<p>It is for these reasons that implicit personalization is an important compliment to explicit personalization. One could compare it to the focus adjustments of a telescope, with the course adjustment being explicit personalization and the fine adjustment being implicit personalization; you could certainly make out some celestial objects with only the course adjustment, but you need the fine adjustment to really discern the details.</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=26</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Discovering Discovery</title>
		<link>http://blog.surfcanyon.com/2007/10/11/discovering-discovery/</link>
		<comments>http://blog.surfcanyon.com/2007/10/11/discovering-discovery/#comments</comments>
		<pubDate>Thu, 11 Oct 2007 17:43:51 +0000</pubDate>
		<dc:creator>Surf Canyon</dc:creator>
				<category><![CDATA[Discovery]]></category>
		<category><![CDATA[Recommendations]]></category>
		<category><![CDATA[Tutorials]]></category>

		<guid isPermaLink="false">http://blog.surfcanyon.com/2007/10/11/discovering-discovery/</guid>
		<description><![CDATA[Here&#8217;s a quick 1, 2, 3 to help you get the most out of Surf Canyon&#8217;s Discovery Engine for Search™. Step 1: Enter you query at search.SurfCanyon.com or, if you like, download our add-on and then go to your favorite search engine (Google, Yahoo! or Bing), just like you always do, and enter your keywords. [...]]]></description>
			<content:encoded><![CDATA[<p>Here&#8217;s a quick 1, 2, 3 to help you get the most out of Surf Canyon&#8217;s Discovery Engine for Search™.</p>
<p><strong>Step 1:</strong> Enter you query at <a href="http://search.SurfCanyon.com" target="_blank">search.SurfCanyon.com</a> or, if you like, <a href="http://www.surfcanyon.com/extension.jsp" target="_blank">download our add-on</a> and then go to your favorite search engine (Google, Yahoo! or Bing), just like you always do, and enter your keywords.</p>
<p>Our example will use a search for the term “dolphins” since it is often used to illustrate one of the classic problems with information retrieval. No matter how much information is gathered about the subject, the individual, or all the individuals who have ever searched for “dolphins”, it is impossible to know <em>a priori</em> whether or not the searcher’s intent is related to the animal or to the football team. While search engines may suggest alternate queries or reformulations through the use of keyword suggestions or clusters, the search results produced will always be a mix of the animal and the football team.</p>
<p><a href="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_1.jpg" title="Football and animal mixed together on results page"><img src="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_1.jpg" title="Football and animal mixed together on results page" alt="Football and animal mixed together on results page" align="middle" height="400" width="500" /></a></p>
<p><a href="http://blog.surfcanyon.com/?attachment_id=8" rel="attachment wp-att-8" title="Screen Shot of Query for “Dolphins”"></a><a href="http://blog.surfcanyon.com/wp-content/uploads/2007/10/dolphins-screen-shot.jpg" title="Screen Shot of Query for “Dolphins”"></a></p>
<p><a href="http://blog.surfcanyon.com/wp-content/uploads/2007/10/dolphins-screen-shot.jpg" title="Screen Shot of Query for “Dolphins”"></a></p>
<p><strong>Step 2:</strong> Review the results and select the one that corresponds to your interests. If it&#8217;s the Miami Dolphins that you like, click #2: MiamiDolphins.com. If your search is satisfied, great! If not, return to the search results and Surf Canyon will provide you with some &#8220;Recommendations&#8221;. You may, if you like, also get the &#8220;instant&#8221; Recommendations by simply clicking the bull&#8217;s eye next to the particular link of your choice.</p>
<p><a href="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_2.jpg" title="Recommended search results for MiamiDolphins.com"><img src="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_2.jpg" title="Recommended search results for MiamiDolphins.com" alt="Recommended search results for MiamiDolphins.com" align="middle" height="400" width="500" /></a></p>
<p>Recommendations are <em>dug out</em> of the search results based on the Discovery Engine&#8217;s inference of our real-time intent. Put another way, Surf Canyon figures out what you want and then goes past the first page of the results and fetches it for you.</p>
<p><strong>Step 3:</strong> The process is cumulative, so continue clicking on links and bull&#8217;s eyes that are of interest to you. The longer you work on the result set, the better Surf Canyon understands your intent and the more accurate are the Recommendations. The Recommendations can nest to three levels, enabling you to &#8220;drill down&#8221; on not just your high-level intent, but your sub-intents as well, such as &#8220;Miami Dolphins news.&#8221;</p>
<p><a href="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_3.jpg" title="Second level of Recommendations for Miami Dolphins news"><img src="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_3.jpg" title="Second level of Recommendations for Miami Dolphins news" alt="Second level of Recommendations for Miami Dolphins news" align="middle" height="400" width="500" /></a><a href="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_1.jpg" title="Football and animal mixed together on results page"> </a></p>
<p>Click on &#8220;More Results&#8221; and you&#8217;ll get a second page of results automatically customized to your intent. In this case, they are naturally all related to the Miami Dolphins football team and are geared towards news.</p>
<p><a href="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_4.jpg" title="Subsequent pages of results are customized in real-time"><img src="http://blog.surfcanyon.com/wp-content/uploads/2008/02/discovering-discovery-_4.jpg" title="Subsequent pages of results are customized in real-time" alt="Subsequent pages of results are customized in real-time" align="middle" height="400" width="500" /></a></p>
<p>Keep going and Surf Canyon will keep working to help you find what you need.</p>
<p>That&#8217;s Discovery!</p>
<p>Lastly, don&#8217;t worry about &#8216;mistakes&#8217;: clicking the links that are perhaps not of interest. Surf Canyon is smart enough to handle it.</p>
]]></content:encoded>
			<wfw:commentRss>http://js-kit.com/rss/blog.surfcanyon.com/p=7</wfw:commentRss>
		<slash:comments>15</slash:comments>
		</item>
	</channel>
</rss>

