Search found 502 matches

by webmaster
Sat Aug 29, 2020 2:53 am
Forum: Templates
Topic: AliExpress
Replies: 0
Views: 69

AliExpress

To use this template, place it on an empty folder and open it with Helium Scraper 3. Check this video for a quick walkthrough. The template has two globals that extract different pieces of information: ProductLinks : This global extracts top-level information without visiting any actual product page...
by webmaster
Sat Jun 13, 2020 5:10 pm
Forum: Q/A
Topic: Way Scroll Element Inside Page?
Replies: 1
Views: 1183

Re: Way Scroll Element Inside Page?

All scrolling functions scroll the currently selected element, which defaults to the whole page. So if you select the scrollable element it will scroll that element instead of the whole page: Select.ScrollableElement InfiniteScroll · Select.ListItem · 1000 · true To select the scrollable element, ju...
by webmaster
Tue Jun 09, 2020 9:12 pm
Forum: Q/A
Topic: Common Crawler: is it possible to download the found html files?
Replies: 2
Views: 820

Re: Common Crawler: is it possible to download the found html files?

We've just updated Common Crawler to include the Sequence.WriteFile function. If you don't get an update prompt, this may be because we've migrated the publish location to AWS. If so, just uninstall it and reinstall it from here . Once you have the latest version (3.2.4.9) you can do this to save th...
by webmaster
Tue May 19, 2020 10:31 pm
Forum: Q/A
Topic: SQLite
Replies: 2
Views: 1112

Re: SQLite

There's no way to automatically flush the data, but you can save the project with File -> Save while it's still running without having to stop the extraction. Regarding the dot, not 100% sure about this but I think you can use brackets like "[Some.Thing]" in Access. Anyway, it'd make sense to be abl...
by webmaster
Tue May 19, 2020 10:27 pm
Forum: Q/A
Topic: Saving full HTML of URLs
Replies: 1
Views: 967

Re: Saving full HTML of URLs

You can use Gather.HTML to get the current page HTML (or the HTML of any particular element when the element is selected), and since version 3.2.4.8 you can use Sequence.WriteFile to write files with arbitrary text content. In your case, you could do something like this, supposing all the pages you'...
by webmaster
Sat Apr 11, 2020 2:04 am
Forum: Q/A
Topic: How To Change User Agent in Helium Scraper ???
Replies: 3
Views: 7846

Re: How To Change User Agent in Helium Scraper ???

Since version 3.2.4.4 you can override the user agent at Project -> Settings -> User Agent.
by webmaster
Thu Mar 05, 2020 7:53 pm
Forum: Premades
Topic: Meta Data
Replies: 0
Views: 3369

Meta Data

Import this function to extract meta data . To use this on your project, follow these instructions . { "api": "1.0.2.1", "name": "MetaData", "parameters": { "name": { "caption": "Name", "type": "text", "default": "description" } } } function (name) Browser.EvalScript · "try { return document.querySe...
by webmaster
Tue Jan 21, 2020 4:51 am
Forum: Q/A
Topic: Select different kind of elements with one selector
Replies: 1
Views: 1722

Re: Select different kind of elements

There are a couple of ways to do this. One is to create a selector for each kind of element, and then use the + operator whenever you want to select any of the elements: + · Select.ButtonA · Select.ButtonP If you're using a wizard you will only be able to select a specific selector, so just select a...
by webmaster
Sun Dec 15, 2019 4:23 pm
Forum: Announcements
Topic: Infatica and Helium Scraper Integration
Replies: 0
Views: 10746

Infatica and Helium Scraper Integration

Instructions on how to integrate Infatica proxies into Helium Scraper have been published here.
by webmaster
Mon Dec 02, 2019 5:22 pm
Forum: Q/A
Topic: SelectBy.Script
Replies: 3
Views: 2659

Re: Select.ByScript

Yes, Select.ByScript must return a DOM element or a list of DOM elements. I guess the documentation should be explicit about this. If you just need to get a string from a script, you can use Browser.EvalScript like this: Browser.EvalScript · "return 'Hello World!';" · 0 as myString extract test mySt...