Side Project: Prober – AI Visual Website Scraper

With scraping data from the internet becoming more difficult and with new technologies being developed at a faster rate than ever before, I decided to build a new system (for personal use) called Prober, which designed to overcome the difficulties of scraping.

Prober scrapes websites (if robots.txt allow it to do so) without code or the use of xPaths. With websites constantly changing results in xPaths requiring changes, despite the fact that often, visually, nothing too drastic changes. Prober can predict visually where certain elements are placed on a page so that you don’t need need to update the xPaths. The system is trained, via visual annotation, on what an element looks like within a webpage; so doesn’t matter if a site’s layout slightly changes. Also It does not interact with the DOM at all which is what most scrapers rely on at the moment.

Prober is still in development but I can see the benefits and I’m loving it!

Like what you see then, fancy a chat?

Email me Phone me

Testimonials

I have used Dean Wronowski’s web services for about 7 years. Dean designed and maintains two websites for me. Both sites were perfect from planning to launch and still function well. I was particularly impressed with his ability to design sites, all I did was supply the text and photographs. Dean did the rest. Dean has always responded promptly to occasional need for tweeks and kept everything running smoothly. I have no hesitation in recommending him to small or large organisations. Whatever your requirements, commercial, charity, informative or social, in my experience, Dean’s design, layout and hosting are excellent and hassle free

Luigi Ciapparelli - Trestone Dental