<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Statistically improbable phrases</title>
	<atom:link href="http://www.s-anand.net/blog/statistically-improbable-phrases/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.s-anand.net/blog/statistically-improbable-phrases/</link>
	<description>Technology, business and fun</description>
	<lastBuildDate>Mon, 08 Mar 2010 12:25:16 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: DMac</title>
		<link>http://www.s-anand.net/blog/statistically-improbable-phrases/comment-page-1/#comment-1602</link>
		<dc:creator>DMac</dc:creator>
		<pubDate>Wed, 23 Aug 2006 18:00:00 +0000</pubDate>
		<guid isPermaLink="false">http://localhost/blog/statistically-improbable-phrases/#comment-1602</guid>
		<description>This is great. Can you be more specific on how you did this? You say that the words listed are &quot;common&quot; words that appear more than 10 times more often - what is your criterion for &quot;common&quot;? Also, what were the ranges of improbability and oftenness that you mapped into the size and color of the results. How are improbability and oftenness different, anyway? Finally, how did you handle any words in C&amp;H that didn&#039;&#039;t appear in your corpus?   I&#039;&#039;m very interested in hearing more from you about how you did this - I&#039;&#039;m looking forward to hearing from you.   Best regards...</description>
		<content:encoded><![CDATA[<p>This is great. Can you be more specific on how you did this? You say that the words listed are &#8220;common&#8221; words that appear more than 10 times more often &#8211; what is your criterion for &#8220;common&#8221;? Also, what were the ranges of improbability and oftenness that you mapped into the size and color of the results. How are improbability and oftenness different, anyway? Finally, how did you handle any words in C&#038;H that didn&#8221;t appear in your corpus?   I&#8221;m very interested in hearing more from you about how you did this &#8211; I&#8221;m looking forward to hearing from you.   Best regards&#8230;</p>
]]></content:encoded>
	</item>
</channel>
</rss>
