Side Project: Prober – AI Visual Website Scraper

With scraping data from the internet becoming more difficult and with new technologies being developed at a faster rate than ever before, I decided to build a new system (for personal use) called Prober, which designed to overcome the difficulties of scraping.

Prober scrapes websites (if robots.txt allow it to do so) without code or the use of xPaths. With websites constantly changing results in xPaths requiring changes, despite the fact that often, visually, nothing too drastic changes. Prober can predict visually where certain elements are placed on a page so that you don’t need need to update the xPaths. The system is trained, via visual annotation, on what an element looks like within a webpage; so doesn’t matter if a site’s layout slightly changes. Also It does not interact with the DOM at all which is what most scrapers rely on at the moment.

Prober is still in development but I can see the benefits and I’m loving it!

Like what you see then, fancy a chat?

Email me Phone me

Testimonials

Dean Wronowski has single handedly rebranded Bude. Not only in demand for web and graphics across the town and used by many businesses, Dean is now working for some serious national and international campaigns due to his original designs and professional approach. If you are looking to build a great original brand, complete with website, graphics and all marketing materials then this man is the only man you need to speak to, saving you hours of liaising  between agencies and making it simple and fast. Whether you have a clear idea or what you want or need someone to guide you in all areas Dean is able to deliver on every level.
He is also a complete legend!
Beth - Beach House Widemouth Bay