🌸 Digital Garden

Workshop 3

← Back to Reflections

What I’ve done

In this workshop, we try to use wedscraper to collect data.

POINT 1

Webscraper is a plugin. After the web page selection check, we need to find the last option in the pop-up box.

Introduction to Arduino as a Creative Hacking Tool

POINT 3

More thoughts

In this Web Scraping workshop, I started to look at websites from the perspective of “a webpage is also data,” and using the developer tools to inspect HTML structure suddenly felt much clearer. When I tried using tools like OutWit Hub and WebScraper.io, I realised that scraping is more complicated than it looks. For example, some sites only load new content when you scroll down, but the tools couldn’t scroll automatically, so I could only capture whatever appeared on the first screen. The auto-generated selectors were also not always accurate, and I often ended up scraping empty fields or duplicated items. I later tried inspecting the code manually and adjusting the CSS selectors myself, which improved the accuracy; I also found that tools like Octoparse can simulate scrolling, making them more suitable for dynamic pages. Overall, this workshop made me realize that while tools are useful, the real key to successful scraping is understanding how webpages are structured and being willing to experiment when things don’t work. In the future, combining these tools with Python would likely help me scrape data more efficiently and accurately.

Reading references (Rogers, 2024)