<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Create your First Web Scraper to Extract Data from a Web Page</title>
	<atom:link href="http://blog.outwit.com/?feed=rss2&#038;p=55" rel="self" type="application/rss+xml" />
	<link>http://blog.outwit.com/?p=55</link>
	<description>Harvest the Web</description>
	<lastBuildDate>Wed, 25 Aug 2010 16:22:12 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
	<item>
		<title>By: jcc</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1964</link>
		<dc:creator>jcc</dc:creator>
		<pubDate>Mon, 17 May 2010 07:58:13 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1964</guid>
		<description>@Jim: The Apply to URL in the scraper can be a partial URL, or a regExp or just a string to be found in the current URL for the program to decide if the scraper applies to the page. The idea is that once on a page, simply going to the &#039;scraped&#039; view will execute the process. Imagine you put &#039;mySite.com/search?&#039; in the apply to URL, then the scraper will apply to any result pages on mySite ; in this case, we cannot load the page as we only have a partial URL. Macros, however will allow this in the Pro version.</description>
		<content:encoded><![CDATA[<p>@Jim: The Apply to URL in the scraper can be a partial URL, or a regExp or just a string to be found in the current URL for the program to decide if the scraper applies to the page. The idea is that once on a page, simply going to the &#8216;scraped&#8217; view will execute the process. Imagine you put &#8216;mySite.com/search?&#8217; in the apply to URL, then the scraper will apply to any result pages on mySite ; in this case, we cannot load the page as we only have a partial URL. Macros, however will allow this in the Pro version.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jcc</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1963</link>
		<dc:creator>jcc</dc:creator>
		<pubDate>Mon, 17 May 2010 07:46:43 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1963</guid>
		<description>For this, you can use the right arrow (next) or the double right arrow (browse) if they are active. You can also collect a list of URLs in your catch for instance, select them and use the right click menu option &#039;Browse through selected URLs&#039;.

(NOTE: For technical questions, please, do use the feedback link rather than posting a comment to the blog for support tickets to be followed.)</description>
		<content:encoded><![CDATA[<p>For this, you can use the right arrow (next) or the double right arrow (browse) if they are active. You can also collect a list of URLs in your catch for instance, select them and use the right click menu option &#8216;Browse through selected URLs&#8217;.</p>
<p>(NOTE: For technical questions, please, do use the feedback link rather than posting a comment to the blog for support tickets to be followed.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jim</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1960</link>
		<dc:creator>Jim</dc:creator>
		<pubDate>Sun, 16 May 2010 04:04:05 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1960</guid>
		<description>One other observation:

since the scraper saves the target URL, it seems it might be a good idea to have it go to that URL automatically when executed rather than having to copy and paste it into the address field .. ?</description>
		<content:encoded><![CDATA[<p>One other observation:</p>
<p>since the scraper saves the target URL, it seems it might be a good idea to have it go to that URL automatically when executed rather than having to copy and paste it into the address field .. ?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jim</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1959</link>
		<dc:creator>Jim</dc:creator>
		<pubDate>Sun, 16 May 2010 03:57:47 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1959</guid>
		<description>LOVE this app! Thanks!

Is there a way to catch all of the return from a paged view?
For example, I am working with a search which returns 50 results per page .. but 250 pages.

I&#039;m not quite sure how to have the scraper iterate thru all of the pages.

Can you help?</description>
		<content:encoded><![CDATA[<p>LOVE this app! Thanks!</p>
<p>Is there a way to catch all of the return from a paged view?<br />
For example, I am working with a search which returns 50 results per page .. but 250 pages.</p>
<p>I&#8217;m not quite sure how to have the scraper iterate thru all of the pages.</p>
<p>Can you help?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: pr</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1948</link>
		<dc:creator>pr</dc:creator>
		<pubDate>Thu, 22 Apr 2010 06:32:48 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1948</guid>
		<description>Hi Yigal,

Since the last version, it&#039;s possible to keep HTML tags in scrapers (also in tables, lists). You&#039;ll see a &quot;Clean data&quot; checkbox in the bottom panel, just uncheck it.
We&#039;ll probably work later on getting text in the email view.

Cheers,</description>
		<content:encoded><![CDATA[<p>Hi Yigal,</p>
<p>Since the last version, it&#8217;s possible to keep HTML tags in scrapers (also in tables, lists). You&#8217;ll see a &#8220;Clean data&#8221; checkbox in the bottom panel, just uncheck it.<br />
We&#8217;ll probably work later on getting text in the email view.</p>
<p>Cheers,</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: jcc</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1874</link>
		<dc:creator>jcc</dc:creator>
		<pubDate>Fri, 26 Mar 2010 09:33:33 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1874</guid>
		<description>Hello and thanks. (Please, do use our feedback form in outwit.com for this type or message, it allows us to keep our bug/wish list up to date.)
We are replacing returns by &quot;;&quot; (after having replaced all &quot;;&quot; by &quot;,&quot; ), to make cut and paste to a spreadsheet easier. It is not a completely satisfying solution but it is a better choice in most cases than simply removing the returns, as it allows you to recover a little of the original layout in the destination program, replacing back &quot;;&quot; by returns. We are adding a &quot;clean text&quot; option (checked by default) in all views of the next version. It means that, if you uncheck it, you will be able to keep html tags like &lt;br&gt; in the scraped text.

Actually, this gave us a pretty simple idea: we will add a find/replace function in all datasheets of a next version. It should help in these cases.

Cheers,

JC</description>
		<content:encoded><![CDATA[<p>Hello and thanks. (Please, do use our feedback form in outwit.com for this type or message, it allows us to keep our bug/wish list up to date.)<br />
We are replacing returns by &#8220;;&#8221; (after having replaced all &#8220;;&#8221; by &#8220;,&#8221; ), to make cut and paste to a spreadsheet easier. It is not a completely satisfying solution but it is a better choice in most cases than simply removing the returns, as it allows you to recover a little of the original layout in the destination program, replacing back &#8220;;&#8221; by returns. We are adding a &#8220;clean text&#8221; option (checked by default) in all views of the next version. It means that, if you uncheck it, you will be able to keep html tags like &lt;br&gt; in the scraped text.</p>
<p>Actually, this gave us a pretty simple idea: we will add a find/replace function in all datasheets of a next version. It should help in these cases.</p>
<p>Cheers,</p>
<p>JC</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: ; in place of Line Breaks</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1872</link>
		<dc:creator>; in place of Line Breaks</dc:creator>
		<pubDate>Thu, 25 Mar 2010 19:31:48 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1872</guid>
		<description>I see &quot;;&quot; in place of all the line breaks when using the scrapers. How do we avoid that or remove the &quot;;&quot;? By the way, great tool!</description>
		<content:encoded><![CDATA[<p>I see &#8220;;&#8221; in place of all the line breaks when using the scrapers. How do we avoid that or remove the &#8220;;&#8221;? By the way, great tool!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Yigal</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1825</link>
		<dc:creator>Yigal</dc:creator>
		<pubDate>Sun, 21 Feb 2010 00:03:28 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1825</guid>
		<description>Hi,

Thanks for the amazing program. I do miss one feature though.
The emails view is good, but lacks the text that is displayed for those emails (Usually the name of the person for which this email belongs).

It could be solved either in the emails view or in the scrapper. But the scrapper lacks the ability to catch the HTML tags, so I can catch the name of the person but not its email.

I would be grateful if you could add one of these features.</description>
		<content:encoded><![CDATA[<p>Hi,</p>
<p>Thanks for the amazing program. I do miss one feature though.<br />
The emails view is good, but lacks the text that is displayed for those emails (Usually the name of the person for which this email belongs).</p>
<p>It could be solved either in the emails view or in the scrapper. But the scrapper lacks the ability to catch the HTML tags, so I can catch the name of the person but not its email.</p>
<p>I would be grateful if you could add one of these features.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: How to Extract Any Web Page Information and Export it to Excel</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-1180</link>
		<dc:creator>How to Extract Any Web Page Information and Export it to Excel</dc:creator>
		<pubDate>Sun, 20 Sep 2009 22:53:04 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-1180</guid>
		<description>[...] is a detailed info on creating your first scraper as well as the post where I found this cool [...]</description>
		<content:encoded><![CDATA[<p>[...] is a detailed info on creating your first scraper as well as the post where I found this cool [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: OutWit Of hub - We Export Any Information From The Web Pages InTo Excel &#124; Seo Sheen</title>
		<link>http://blog.outwit.com/?p=55&#038;cpage=1#comment-996</link>
		<dc:creator>OutWit Of hub - We Export Any Information From The Web Pages InTo Excel &#124; Seo Sheen</dc:creator>
		<pubDate>Fri, 14 Aug 2009 06:33:34 +0000</pubDate>
		<guid isPermaLink="false">http://blog.outwit.com/?p=55#comment-996</guid>
		<description>[...] because in it there is a possibility of writing its rules for [parsing] - Create of your Of first Of scraper (instruction with the [...]</description>
		<content:encoded><![CDATA[<p>[...] because in it there is a possibility of writing its rules for [parsing] &#8211; Create of your Of first Of scraper (instruction with the [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>
