Member-only story

Introduction to web scraping and real world task

Donald Le
4 min readDec 9, 2019

--

Recently, I’ve been assigned a task to check whether the list of page news is too old or not.
That comes the part I need to go over the page source to get the date information.

I’ve heard about webscraping before so for this task I applied web scraping technique.
Here is what I learnt.

I.What is webscraping

If you’ve ever copy and pasted information from a website, you’ve performed the same function as any web scraper, only on a microscopic, manual scale.

Web scraping, also known as web data extraction, is the process of retrieving or “scraping” data from a wehttps://dev.to/cuongld2/introduction-to-web-scraping-and-real-world-task-2pkjbsite. Unlike the mundane, mind-numbing process of manually extracting data, web scraping uses intelligent automation to retrieve hundreds, millions, or even billions of data points from the internet’s seemingly endless frontier.

More than a modern convenience, the true power of web scraping lies in its ability to build and power some of the world’s most revolutionary business applications. ‘Transformative’ doesn’t even begin to describe the way some companies use web scraped data to enhance their operations, informing executive decisions all the way down to individual customer service experiences.

II.What we’ll need
We will apply the webscraping in Python so we need to check out information about below libraries.

--

--

Donald Le
Donald Le

Written by Donald Le

A passionate automation engineer who strongly believes in “A man can do anything he wants if he puts in the work”.

No responses yet