Google Extension for Ad Hoc Building / Website Cleaning

To help with the development of  an automatic website scraper, a number of steps can be taken. One of these steps is reducing the noise on the website. Secondly to create an automatic xPath extraction which basically defines the path to a certain element within the website.

I was tasked with developing a Google Extension that automatically removes the noise whilst also automatically calculating the xPaths for all the required fields from within the website. These fields are then passed to a system which then starts the automatic scraping.

Like what you see then, fancy a chat?

Email me Phone me

Testimonials

I have used Dean Wronowski’s web services for about 7 years. Dean designed and maintains two websites for me. Both sites were perfect from planning to launch and still function well. I was particularly impressed with his ability to design sites, all I did was supply the text and photographs. Dean did the rest. Dean has always responded promptly to occasional need for tweeks and kept everything running smoothly. I have no hesitation in recommending him to small or large organisations. Whatever your requirements, commercial, charity, informative or social, in my experience, Dean’s design, layout and hosting are excellent and hassle free

Luigi Ciapparelli - Trestone Dental