<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Evernote Indexing System</title>
	<atom:link href="http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/</link>
	<description>The Care and Feeding of Elephants</description>
	<lastBuildDate>Thu, 23 May 2013 14:43:46 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.5.1</generator>
	<item>
		<title>By: Owyn</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-515</link>
		<dc:creator>Owyn</dc:creator>
		<pubDate>Tue, 04 Oct 2011 23:17:29 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-515</guid>
		<description><![CDATA[Alex. This has been a big help in clarifying my understanding of how Evernote works.

Thanks.]]></description>
		<content:encoded><![CDATA[<p>Alex. This has been a big help in clarifying my understanding of how Evernote works.</p>
<p>Thanks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alex Pashintsev</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-514</link>
		<dc:creator>Alex Pashintsev</dc:creator>
		<pubDate>Tue, 04 Oct 2011 22:19:12 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-514</guid>
		<description><![CDATA[&gt;&gt; What is the minimum change required to a note to force a re-index of its resources?

Actually, it is not the note change that matters, but the *resource* itself. The &#039;fingerprint&#039; is stored separately for each of the resources, so if you want an image to be indexed anew, it should be actually different -- even if in a single bit. Changing the note that contains it will have no effect. 

&gt;&gt; I do understand that processing time for a resource will add to total transaction processing time.

Yes - while it is just seconds for an average image, it will certainly be much longer for a huge multi-page PDF.]]></description>
		<content:encoded><![CDATA[<p>&gt;&gt; What is the minimum change required to a note to force a re-index of its resources?</p>
<p>Actually, it is not the note change that matters, but the *resource* itself. The &#8216;fingerprint&#8217; is stored separately for each of the resources, so if you want an image to be indexed anew, it should be actually different &#8212; even if in a single bit. Changing the note that contains it will have no effect. </p>
<p>&gt;&gt; I do understand that processing time for a resource will add to total transaction processing time.</p>
<p>Yes &#8211; while it is just seconds for an average image, it will certainly be much longer for a huge multi-page PDF.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Owyn</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-513</link>
		<dc:creator>Owyn</dc:creator>
		<pubDate>Tue, 04 Oct 2011 13:34:09 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-513</guid>
		<description><![CDATA[PPS: Guessing 3 sigma for premium users of about 5 minutes, excepting any abnormal site maintenance problems.]]></description>
		<content:encoded><![CDATA[<p>PPS: Guessing 3 sigma for premium users of about 5 minutes, excepting any abnormal site maintenance problems.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Owyn</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-512</link>
		<dc:creator>Owyn</dc:creator>
		<pubDate>Tue, 04 Oct 2011 13:24:58 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-512</guid>
		<description><![CDATA[PS: I do understand that processing time for a resource will add to total transaction processing time. However, the worst case example that Dave Engberg mentioned in his ETC presentation was, IIRC, about an hour for a specific PDF.]]></description>
		<content:encoded><![CDATA[<p>PS: I do understand that processing time for a resource will add to total transaction processing time. However, the worst case example that Dave Engberg mentioned in his ETC presentation was, IIRC, about an hour for a specific PDF.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Owyn</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-511</link>
		<dc:creator>Owyn</dc:creator>
		<pubDate>Tue, 04 Oct 2011 13:18:18 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-511</guid>
		<description><![CDATA[My example of a null change got trashed by your html filter.

e.g. [space][backspace]]]></description>
		<content:encoded><![CDATA[<p>My example of a null change got trashed by your html filter.</p>
<p>e.g. [space][backspace]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Owyn</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-510</link>
		<dc:creator>Owyn</dc:creator>
		<pubDate>Tue, 04 Oct 2011 13:14:53 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-510</guid>
		<description><![CDATA[Thanks for the response. Very helpful.

Bit confused about the queue management as described. It suggests LIFO for Premium and FIFO for Free.

The service times as described suggest service time distribution of 1 sigma of 2 minutes, 2 sigma of 15 minutes, and 3 sigma of perhaps 2 hours. However, I have seen frequent comments on forum about unsearchable and therefore unindexed resources that have lasted days. Some of those are explainable by the user not understanding the desktop client 2 sync requirement to see the indexing. Some are not.

What is the minimum change required to a note to force a re-index of its resources? Something like touching the note body with a null change? e.g .

Wish there was a more direct attribute that could be searched/tested to determine a notes index status, however...

Thanks again for the very useful information.]]></description>
		<content:encoded><![CDATA[<p>Thanks for the response. Very helpful.</p>
<p>Bit confused about the queue management as described. It suggests LIFO for Premium and FIFO for Free.</p>
<p>The service times as described suggest service time distribution of 1 sigma of 2 minutes, 2 sigma of 15 minutes, and 3 sigma of perhaps 2 hours. However, I have seen frequent comments on forum about unsearchable and therefore unindexed resources that have lasted days. Some of those are explainable by the user not understanding the desktop client 2 sync requirement to see the indexing. Some are not.</p>
<p>What is the minimum change required to a note to force a re-index of its resources? Something like touching the note body with a null change? e.g .</p>
<p>Wish there was a more direct attribute that could be searched/tested to determine a notes index status, however&#8230;</p>
<p>Thanks again for the very useful information.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Alex Pashintsev</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-509</link>
		<dc:creator>Alex Pashintsev</dc:creator>
		<pubDate>Tue, 04 Oct 2011 00:21:04 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-509</guid>
		<description><![CDATA[The post was intended to be more of a technical design overview, thus did not focus so much on the usage side -- but let&#039;s touch it too. 
To your questions:
-- free and premium queues are actually the same queue, only that premium images are inserted in the front, instead of being placed in the end of it. 
-- so if the indexing system is not too busy, there may be not much of a difference in processing time - in both cases image will pass the full cycle in a matter of minutes. But if there is a spike of user activity, the reco queue may span an hour or more. In this case premium images will still be processed in couple of minutes - while regular submissions will have to wait.
-- to check if a note was already indexed, sync your desktop EN client once again in a few minutes after you created the note. For PDF notes, check for the option &#039;Save Searchable PDF&#039; when you right-click the PDF object in question. If this line is present in the menu, the note was already indexed. For image notes, choose to &#039;Export&#039; the note to the &#039;archive&#039; .enex format. This will produce an XML file that you can open with a text editor or your browser. Look in the end for the tag  which wraps the image indexing results - if it is present, image was processed and is searchable now.
-- once a note is created and synced up to the Service, its resources are processed according to the user settings at the moment. This includes language preferences, premium status, etc. If any of those are changed after the note was synced up, it will not affect the processing. The note&#039;s images will still stay in the end of the reco queue if the user status changes to &#039;premium&#039;. They will still be recognized as English-only even if the user preferences change to &#039;Japanese+English&#039; from &#039;English&#039; while the resources are still in queue. 
N.B. EN Service keeps the &#039;fingerprints&#039; of all images and PDFs in the system to avoid storing multiple copies of the same file. So even after the language preferences were changed, same image included into a different note will just assume personality of its first copy -- including its reco index. So to have the index change to a different language, it will be required to provide a slightly modified version of the image, or request all nothes reindexing in the user personal settings.]]></description>
		<content:encoded><![CDATA[<p>The post was intended to be more of a technical design overview, thus did not focus so much on the usage side &#8212; but let&#8217;s touch it too.<br />
To your questions:<br />
&#8211; free and premium queues are actually the same queue, only that premium images are inserted in the front, instead of being placed in the end of it.<br />
&#8211; so if the indexing system is not too busy, there may be not much of a difference in processing time &#8211; in both cases image will pass the full cycle in a matter of minutes. But if there is a spike of user activity, the reco queue may span an hour or more. In this case premium images will still be processed in couple of minutes &#8211; while regular submissions will have to wait.<br />
&#8211; to check if a note was already indexed, sync your desktop EN client once again in a few minutes after you created the note. For PDF notes, check for the option &#8216;Save Searchable PDF&#8217; when you right-click the PDF object in question. If this line is present in the menu, the note was already indexed. For image notes, choose to &#8216;Export&#8217; the note to the &#8216;archive&#8217; .enex format. This will produce an XML file that you can open with a text editor or your browser. Look in the end for the tag  which wraps the image indexing results &#8211; if it is present, image was processed and is searchable now.<br />
&#8211; once a note is created and synced up to the Service, its resources are processed according to the user settings at the moment. This includes language preferences, premium status, etc. If any of those are changed after the note was synced up, it will not affect the processing. The note&#8217;s images will still stay in the end of the reco queue if the user status changes to &#8216;premium&#8217;. They will still be recognized as English-only even if the user preferences change to &#8216;Japanese+English&#8217; from &#8216;English&#8217; while the resources are still in queue.<br />
N.B. EN Service keeps the &#8216;fingerprints&#8217; of all images and PDFs in the system to avoid storing multiple copies of the same file. So even after the language preferences were changed, same image included into a different note will just assume personality of its first copy &#8212; including its reco index. So to have the index change to a different language, it will be required to provide a slightly modified version of the image, or request all nothes reindexing in the user personal settings.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Owyn</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-508</link>
		<dc:creator>Owyn</dc:creator>
		<pubDate>Sun, 02 Oct 2011 21:20:52 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-508</guid>
		<description><![CDATA[Ummm. Fixing typos in previous post.

Good article, however, it does not cover several of the most important issues from a user’s perspective.

- How are the Free and Premium user queues managed?
- What are typical service times for the queues?
- How can a user determine if a note&#039;s resources have been indexed?
- What happens to pending unindexed resource requests when a user upgrades from free to premium?]]></description>
		<content:encoded><![CDATA[<p>Ummm. Fixing typos in previous post.</p>
<p>Good article, however, it does not cover several of the most important issues from a user’s perspective.</p>
<p>- How are the Free and Premium user queues managed?<br />
- What are typical service times for the queues?<br />
- How can a user determine if a note&#8217;s resources have been indexed?<br />
- What happens to pending unindexed resource requests when a user upgrades from free to premium?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Owyn</title>
		<link>http://blog.evernote.com/tech/2011/09/30/evernote-indexing-system/#comment-507</link>
		<dc:creator>Owyn</dc:creator>
		<pubDate>Sun, 02 Oct 2011 21:00:56 +0000</pubDate>
		<guid isPermaLink="false">http://blog.evernote.com/tech/?p=180#comment-507</guid>
		<description><![CDATA[Good article, unfortunately, it does not cover several of the most important issues from a user&#039;s perspective.

- How are the Free and Premium user queues managed
- What are typical service times for the queues
- How can a user determine if a note resource has been indexed
- What happens to pending unindexed resource requests when a user upgrades from to premium.]]></description>
		<content:encoded><![CDATA[<p>Good article, unfortunately, it does not cover several of the most important issues from a user&#8217;s perspective.</p>
<p>- How are the Free and Premium user queues managed<br />
- What are typical service times for the queues<br />
- How can a user determine if a note resource has been indexed<br />
- What happens to pending unindexed resource requests when a user upgrades from to premium.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
