Google Extension for Ad Hoc Building / Website Cleaning

To help with the development of  an automatic website scraper, a number of steps can be taken. One of these steps is reducing the noise on the website. Secondly to create an automatic xPath extraction which basically defines the path to a certain element within the website.

I was tasked with developing a Google Extension that automatically removes the noise whilst also automatically calculating the xPaths for all the required fields from within the website. These fields are then passed to a system which then starts the automatic scraping.

Like what you see then, fancy a chat?

Email me Phone me

Testimonials

We have been working with Dean for several years and we could not be happier with his service and help. Being a small business without much time to sort out our website needs we really appreciate dealing with someone who has taken the time to understand what we do and how this relates to the crazy ideas we sometimes have regarding our website and how it it used. Dean has given good advice and when things need sorting out he is very prompt and the work gets done very quickly. We recommend Dean’s services to everyone.

Simon Hammond - Shoreline Extreme Sports