Burn Media Sites

You Mailed It! Celebrating South Africa’s Slickest Email Campaigns of 2025

Inbox icons, subject line sorcerers, CTA kings – the results are in. The 2025 You Mailed It Awards by Everlytic have crowned their champs, with Old Mutual Rewards and Machine_ taking…

Gen Z and Millennials Quietly Disrupt South Africa’s Insurance Industry

A silent revolution is underway in South Africa’s insurance industry, and it’s being led by Gen Z and Millennials. These digitally native generations are…

Stranger Things Season 5 Teaser Drops

Netflix has officially unveiled the teaser trailer for Stranger Things Season 5, setting the stage for the final chapter in the award-winning sci-fi series….

MTN Group Fintech’s Mandela Day campaign is about much more than hygiene kits

By now, most Mandela Day campaigns follow a familiar script: brief acts of corporate kindness, obligatory photos, and promises to “give back.” MTN Group…

Old Mutual SMEgo has launched the 2025 Entrepreneur Mental Wellbeing Survey, a national initiative that puts the mental health of small business owners at…

Arab Bank for Economic Development in Africa Joins AWIEF 2025

With just four months to go until the 11th edition of the Africa Women Innovation and Entrepreneurship Forum (AWIEF), organisers have announced a keynote…

Acer Unleashes Beastly New Predator BiFrost and Nitro GPUs with AMD Radeon RX 9000 Series

Acer is raising the bar for gaming and content creation with the launch of its latest Predator BiFrost and Nitro graphics cards, now powered…

Philips Evnia Drops Jaw-Dropping QD OLED Monitors: 240Hz, Ambiglow, and All the Good Stuff

Game On, Reality Off: Philips Evnia Unleashes QD OLED Mayhem Let’s cut to the chase: Philips Evnia just nuked the gaming monitor scene. The…

Microsoft launches new Surface devices in new AI era

Microsoft today announced the general availability of the all-new Surface Pro and the all-new Surface Laptop to empower users in South Africa to unlock…

TDK enhances IMUs for extreme temps

TDK has responded to developing market needs with a new range of advanced inertial measurement units (IMU) for automotive applications. The Japanese electronics giant…

Data centres and defence are reviving diesel

Data centres will command power equivalent to the entire Japanese power grid by 2030. It’s a startling prediction and one that infrastructure futurists, data…

The most recognisable tactical pickup truck evolves

Perhaps the most iconic of all light tactical vehicles is the Toyota Land Cruiser Technical. These pickup trucks have been a platform of choice…

Continue in 10 seconds

Skip

Online journalism • 17 Jul 2013

Data journalist? Here’s how to deal with the changes to ScraperWiki

By Peter Verweij

Read next9 gadgets every Doctor Who fan should have, seriously

ScraperWiki

Scraping is an important tool for data journalists. Sometimes you are lucky, and can download your data or copy-paste them from a website. Bad luck; then the data journalist has to look for heavy tools: a wrench like Outwit Hub could do the job. But if this fails too there is one last resort: the crowbar that is ScraperWiki, where you can code your own scraper. Paul Bradshaw payed much attention to ScraperWiki in his book Scraping for Journalists (check out the Memeburn review).

Recently ScraperWiki has been updated and we are not just talking about the look and feel of the website. Luckily you can still continue to use the recipes put together by Bradshaw, but there are a few other things you might need to know.

In order to use the new ScraperWiki, you have to create a new account. Your old login and password aren’t working anymore. Also your scrapers and data are not available automatically at the renewed ScraperWiki. You can find them at the old website, where you can login with your ID and password. There is a script available for exporting your work from the old to the new website. Copying and pasting also works though.

Community

The new ScraperWiki service has several limitations and now comes with a price tag too:

You can use the free version called Community, which is limited to the use of three scrapers and/or datasets not bigger than 8 MB, and not using more than 30 minutes CPU;
Data Scientist is the second option and gives you for US$29 a month an unlimited number of scrapers/datasets with a maximum of 256 MB each and using not more than 30 minutes CPU;
Explorer is the third and last option; for US$9 a month you can use 10 datasets.

When I tried to scrape a new dataset, already having three sets in my account, ScraperWiki immediately served me with a screen demanding I upgrade.

“More powerful for the end-user and more flexible for the coder”: this is the new adage of ScraperWiki. This becomes clear immediately when you want to scrape a new dataset. The old menus are replaced by tiles. ‘Code in your browser’ brings you back to the well-known environment for creating a scraper in various languages (Python, Ruby or PHP are still available but there are new ones added).

Maps and Graphs

Once you have a scraper working, there are now several new possibilities when it comes time to work with your data.

Again we can choose options from different tiles:

You can view your data in a table format
Create a graph or map from the dataset or query your dataset using SQL
Finally you can download your data.

These options are new and work much easier and faster than the old interface, where you had to create a separate view in order to inspect and or download your dataset.

New options in the main menu are tiles for ‘searching for tweets’ and a tile for ‘searching Flickr’ using geo-tags. Also the possibility to upload a spreadsheet, query it with SQL or create graph or map from the data work smoothly. For coders there is an other choice: they can create their own tools and login directly on the ScraperWiki server using SSH.

But where is the old option to look into scrapers of other user, fork them and modify so you can use them for your own purposes? “Unlike Classic, the new ScraperWiki is not aiming to be a place where people publicly share code and data. The new ScraperWiki is, at its heart, a more private, personal service”.

That is bad luck, because studying working scrapers is not only helpful, but also instructive. However, says ScraperWiki you can publish your scrapers on GitHub; or share you data at DataHub.io.

That is a cold comfort, and in the mean time — probably until September — I’ll stick with the old ScraperWiki.

Peter Verweij

9 gadgets every Doctor Who fan should have, seriously

Gearburn • 17 Jul 2013

We use cookies

To improve your experience, deliver personalised content and advertising. Find out more by reading our cookie policy.

Sign up to our newsletter to get the latest in digital insights. sign up

Welcome to Memeburn

By signing up for this email you agree to receive the latest info from Burnmedia Group.

Learn more via our Privacy Policy.

You Mailed It! Celebrating South Africa’s Slickest Email Campaigns of 2025

Gen Z and Millennials Quietly Disrupt South Africa’s Insurance Industry

Stranger Things Season 5 Teaser Drops

MTN Group Fintech’s Mandela Day campaign is about much more than hygiene kits

Arab Bank for Economic Development in Africa Joins AWIEF 2025

Acer Unleashes Beastly New Predator BiFrost and Nitro GPUs with AMD Radeon RX 9000 Series

Philips Evnia Drops Jaw-Dropping QD OLED Monitors: 240Hz, Ambiglow, and All the Good Stuff

Microsoft launches new Surface devices in new AI era

TDK enhances IMUs for extreme temps

Data centres and defence are reviving diesel

The most recognisable tactical pickup truck evolves

Data journalist? Here’s how to deal with the changes to ScraperWiki

Peter Verweij

News

You Mailed It! Celebrating South Africa’s Slickest Email Campaigns of 2025

Gen Z and Millennials Quietly Disrupt South Africa’s Insurance Industry

Stranger Things Season 5 Teaser Drops

AI Showdown: Grok 3, Grok 4, ChatGPT, Gemini and DeepSeek — Which AI Wins for SA Creators?

We use cookies

Welcome to Memeburn