Helium Scraper Blog

September 28, 2021

TypeScript + Helium Scraper

By Juan Soldi Miscellaneous 0 Comments

Since its inception, Helium Scraper has supported injecting and running JavaScript code. This makes it possible to perform complex calculations and access information that is not directly accessible to Helium Scraper, such as JSON data stored in elements and variables. But mapping between JavaScript and Helium Scraper types has always been a hassle. Since JavaScript

March 21, 2021

Low-Code Web Scraping

By Juan Soldi Miscellaneous 0 Comments

Developers live myself love dark-mode code editors, terminals and cryptic looking command lines, but certain tasks, such as web development, data sheet calculations and, of course, web scraping, don’t require this level of technical expertise. Low-code is an approach to software development that allows users of any level of technical knowledge to participate in the

December 11, 2019

Scraping from LinkedIn

By Juan Soldi Helium Scraper Projects 0 Comments

We’ve created a ready-made template that can be used to extract people and company information from LinkedIn. An account is required for the extraction to work. Check with support to see how many profiles/companies you’re allowed to view per day, otherwise, your account could get banned. Getting started To get started, download the template and

November 8, 2019

Proxies for web-scraping: all you need to know

By nathalie@infatica.io Miscellaneous 0 Comments

Anyone who tries to scrape some data from the web sooner or later faces the need to use proxies to bypass certain restrictions and make the process more streamlined and smooth. Moreover, if you’re dealing with large volumes of information, you absolutely must use a proxy. Otherwise, you simply won’t be able to reach your

October 29, 2019

The Plus Operator

By Juan Soldi Miscellaneous 0 Comments

There are many operators in Helium Scraper, but the plus (+) operator deserves its own tutorial, given the number of uses it has. This is because it doesn’t just represent addition, but also concatenations of strings and sequences. Simple Cases Helium Scraper will treat the operator differently, depending on the type of data that is

July 26, 2019

Introducing Common Crawler

By Juan Soldi Miscellaneous 0 Comments

Common Crawler is a free version of Helium Scraper that, instead of loading pages from the web, it loads them from the Common Crawl database. Aimed at both developers and non-developers, it makes it easy to query the common crawl data and then create selectors and actions that extract structured data from the target HTML

December 11, 2018

The Web Scraping Dilemma

By Juan Soldi Miscellaneous 0 Comments

The web scraping community seems to be divided into two sub-worlds. One is the world of programmers, who would often use Python or JavaScript to carefully craft their agents down to the details in a time consuming but ultimately rewarding process. And the other is the world of layman users, who must choose between a

May 15, 2011

The often overlooked JavaScript Gatherers

By Juan Soldi Miscellaneous 1 Comment

Note: This post was written for an older version of Helium Scraper. Gatherers are the eyes of Helium Scraper. And JavaScript gatherers are its user-customized eyes. Let me give you a quick example. I had a user having troubles with a kind that was supposed to select a “next” button in a page. It worked

May 14, 2011

Programming Helium Scraper

By Juan Soldi Helium Scraper Projects, Miscellaneous 2 Comments

Note: This post was written for an older version of Helium Scraper. I’m assuming you already have a little JavaScript knowledge. If not, here is a quick JavaScript tutorial that covers all you need to know for the purpose of this tutorial. I’m also assuming you have some experience working with Helium Scraper. In Helium

May 14, 2011

Minimal JavaScript tutorial for non programmers

By Juan Soldi Miscellaneous 1 Comment

This is a quick javascript tutorial for total non programmers. I won’t focus on javascript as applied to webpages, which is the case for most tutorials, because I’m mainly considering Helium Scraper users. So this tutorial comes handy if you want to learn javascript without necessarily caring about how to design web pages. First off,