I have created one project , but it is giving problem , after extracting some pages it is taking blank pages..
kindly check attached project file and let me know what is problem...
Help for creating project
Help for creating project
- Attachments
-
- infoline-ver-1.hsp
- (414.93 KiB) Downloaded 578 times
Re: Help for creating project
Hi,
I tested it for 10 pages and it worked just fine. What exactly do you mean by "taking blank pages"? If the problem is that you are getting blank pages when navigating to a page, then the problem is on the server side. It's simply serving you blank pages. This could be because it is detecting too many requests from your IP. The only solution here would be to use proxies.
You can set up Helium Scraper to use proxies from Project -> Proxies and have your Repeat action rotate them.
I tested it for 10 pages and it worked just fine. What exactly do you mean by "taking blank pages"? If the problem is that you are getting blank pages when navigating to a page, then the problem is on the server side. It's simply serving you blank pages. This could be because it is detecting too many requests from your IP. The only solution here would be to use proxies.
You can set up Helium Scraper to use proxies from Project -> Proxies and have your Repeat action rotate them.
Juan Soldi
The Helium Scraper Team
The Helium Scraper Team
Re: Help for creating project
Thanks , yes you are right the blank pages was due to site problem,
Now it is working fine , the only problem is that after 10 -12 pages the scraper slow down,
and extracting process becomes very slow and it take lot of time to extract data.
Kindly help me for this problem
Thanks
Regrds
dcmubmai
Now it is working fine , the only problem is that after 10 -12 pages the scraper slow down,
and extracting process becomes very slow and it take lot of time to extract data.
Kindly help me for this problem
Thanks
Regrds
dcmubmai
Re: Help for creating project
Hi,
Again, this is because the server gets slower at serving pages. Now, you might want to try setting a shorter navigation timeout at Project -> Options. This will cause Helium Scraper to do as if the page has finished loading even if it has not whenever that timeout is reached.
Just make sure is not the whole page but just a section you are not interested in that is taking too long to complete loading. If you see that when navigating to one of these pages, the browser stays on the previous page or you see an empty page for a long time, this would mean is the whole page that is taking too long to complete and another solution, such as using proxies, will be required.
Again, this is because the server gets slower at serving pages. Now, you might want to try setting a shorter navigation timeout at Project -> Options. This will cause Helium Scraper to do as if the page has finished loading even if it has not whenever that timeout is reached.
Just make sure is not the whole page but just a section you are not interested in that is taking too long to complete loading. If you see that when navigating to one of these pages, the browser stays on the previous page or you see an empty page for a long time, this would mean is the whole page that is taking too long to complete and another solution, such as using proxies, will be required.
Juan Soldi
The Helium Scraper Team
The Helium Scraper Team