Custom Built Google Chrome Extension for Manual Scraping

Developing automatic website scrapers can sometimes be challenging due to how modern websites are built. Some of these challenges are the continuous layout changes, captchas, bot patterns, paginations, being anonymous, logins and websites that populate new content by continuously scrolling.

I was tasked with developing a Google Chrome Extension that would help with speeding up manual scraping. The biggest benefit of developing a plugin is that the user can load a website as if they are browsing it normally which reduces all the challenges automated scraping is faced with.

The plugin was built in such a way that it allows the user to click anywhere in the page and automatically extract the text. The text is then passed through to a system which is then translated, cleaned and mapped to certain fields in the system. A summary is also visible showing the scraping progress.

Like what you see then, fancy a chat?

Email me Phone me

Testimonials

Dean Wronowski has helped to plan our club website advising us about user friendly layout, designing excellent images and colour schemes that reflect the functioning and ethos of our club.   Whenever we have needed an update task to be completed Dean has responded immediately and helped us to keep the website running smoothly all the time.  He communicates extremely well and is always ready to help with whatever we need for our web site and email accounts.  I thoroughly recommend using Dean for your web communication needs.

Bude Surf Life Saving Club