Multi Process Extraction Doesn't END
Posted: Thu Jan 02, 2014 6:37 pm
Hey,
I created a very simple project extracting addresses from a list of URLs I create.
Action tree:
Main
Start Process
Extract from each URL
-Navigate URLs from URL table
--Extract ADDRESS KIND
Main2
Execute Action Tree: Start Process
The URL Table looks like this:
ID|URLs
1 |Sex.com
2 |Anus.com
3 |etc.com
4 |peepoo.com
So, the extraction works BUT when I run it, it doesn't end. I had a list of about 50 urls I was testing, and expecting about 200 addresses returned total. The extraction ran and just kept on racking up data. I was thinking that 50 urls was taking quite a long time for a multi-process extraction. After the new project ran for about as long as the normal extraction, I looked at the results and noticed I had almost 5000. This leads me to believe that the even though I had populated IDs, they were not being used to track which URl was visited, and all processes were running ALL urls nonstop. Please help!
I created a very simple project extracting addresses from a list of URLs I create.
Action tree:
Main
Start Process
Extract from each URL
-Navigate URLs from URL table
--Extract ADDRESS KIND
Main2
Execute Action Tree: Start Process
The URL Table looks like this:
ID|URLs
1 |Sex.com
2 |Anus.com
3 |etc.com
4 |peepoo.com
So, the extraction works BUT when I run it, it doesn't end. I had a list of about 50 urls I was testing, and expecting about 200 addresses returned total. The extraction ran and just kept on racking up data. I was thinking that 50 urls was taking quite a long time for a multi-process extraction. After the new project ran for about as long as the normal extraction, I looked at the results and noticed I had almost 5000. This leads me to believe that the even though I had populated IDs, they were not being used to track which URl was visited, and all processes were running ALL urls nonstop. Please help!