Google Extension for Ad Hoc Building / Website Cleaning

To help with the development of  an automatic website scraper, a number of steps can be taken. One of these steps is reducing the noise on the website. Secondly to create an automatic xPath extraction which basically defines the path to a certain element within the website.

I was tasked with developing a Google Extension that automatically removes the noise whilst also automatically calculating the xPaths for all the required fields from within the website. These fields are then passed to a system which then starts the automatic scraping.

Like what you see then, fancy a chat?

Email me Phone me

Testimonials

Dean is a grafter, a self taught hard working and proactive developer, I have followed his career development since graduation and would have no hesition in recommending him, his online portfolio is an indication of his capabilities and evidence of his tenacity

Dan Livingstone - Associate Professor - Interactive Systems, Plymouth University