Extracting from list of URL's

Questions and answers about anything related to Helium Scraper
Post Reply
danahotpanty
Posts: 4
Joined: Tue Jun 19, 2012 2:47 pm

Extracting from list of URL's

Post by danahotpanty » Tue Jun 19, 2012 3:18 pm

Dear all ,

Please help me to extract information from a list of url's .I spend about one day trying to extract a list of URL's
I have use Navigate URL's , urls's imported into database but nothing etc.. The application navigate all URL's perfectly but extract only the information from the first URL .

How will look the Actions Tree in order to extract all information from all links . I mention that All links have information structured identic and the links have the follow structure :

http://www.xxxxx.com/profile.php?id=1
http://www.xxxxx.com/profile.php?id=5
http://www.xxxxx.com/profile.php?id=12
http://www.xxxxx.com/profile.php?id=116
http://www.xxxxx.com/profile.php?id=120

etc

webmaster
Site Admin
Posts: 521
Joined: Mon Dec 06, 2010 8:39 am
Contact:

Re: Extracting from list of URL's

Post by webmaster » Wed Jun 20, 2012 3:08 am

Hi,

I've just added this FAQ's post. Please follow the steps described on the "My project is not extracting any / enough data." section. Let me know if anything is not 100% clear.
Juan Soldi
The Helium Scraper Team

danahotpanty
Posts: 4
Joined: Tue Jun 19, 2012 2:47 pm

Re: Extracting from list of URL's

Post by danahotpanty » Wed Jun 20, 2012 10:23 am

Thanks webmaster ,

But this is not helping me so much . The problem is that i do not know how to construct the Actions Tree :
I want something like :
- parse first url > extract data
- parse second url > extract data
- parse third url > extract data

I don't know what i need to use > Extracting and then Navigate URL's or using links into a database .....

Could you please help me with that .hsp template example ? please

Dana

webmaster
Site Admin
Posts: 521
Joined: Mon Dec 06, 2010 8:39 am
Contact:

Re: Extracting from list of URL's

Post by webmaster » Wed Jun 20, 2012 3:28 pm

You're probably putting the Extract action after the Navigate URLs one, while it should be a children of it. To add it as a children, select the Navigate URLs action, the right click and select New Action -> Extract.
Juan Soldi
The Helium Scraper Team

danahotpanty
Posts: 4
Joined: Tue Jun 19, 2012 2:47 pm

Re: Extracting from list of URL's

Post by danahotpanty » Wed Jun 20, 2012 6:39 pm

I have try to do that but not working .

Could i send you my hsp to take a look , it is possible ?

I would be grateful

Dana

webmaster
Site Admin
Posts: 521
Joined: Mon Dec 06, 2010 8:39 am
Contact:

Re: Extracting from list of URL's

Post by webmaster » Wed Jun 20, 2012 11:38 pm

Yes, you should be able to attach files in a reply. Or you could use a service such as this one.
Juan Soldi
The Helium Scraper Team

danahotpanty
Posts: 4
Joined: Tue Jun 19, 2012 2:47 pm

Re: Extracting from list of URL's

Post by danahotpanty » Thu Jun 21, 2012 12:10 pm

I have send you PM with my .hsp .

Thanks

Post Reply