In this VMX screencast, witness the creation of a visual barcode detection program in under 9 minutes. You can see the entire training procedure -- creating an initial data set of labeled barcodes, improving the detector via a 5 minute interactive learning step, and finishing off with a qualitative evaluation of the trained barcode detector.
The inspiration came after reading Dr. Rosebrock's blog post on detecting barcodes using OpenCV and Python (
http://www.pyimagesearch.com/2014/11/24/detecting-barcodes-images-python-opencv/). While the code presented in Rosebrock's blog post is quite simple, it is most definitely domain-specific. Different domain-specific programs must be constructed for different objects. In other words, different kinds of morphological operations, features, and thresholds must be used for detecting different objects and it is not even clear how you would construct the rules to detect a complex object such as a "monkey." If you are just getting started with programming and want to learn how to construct some of these domain-specific programs, you're just going to have to subscribe to
http://www.pyimagesearch.com/.
Writing these kinds of vision programs is hard. Unless... you address the problem with some advanced machine learning techniques. Applying machine learning to visual problems is "the backbone" of what we do at vision.ai and computer vision research has been a personal passion of mine for over a decade. So I decided to take our most recent piece of vision tech for a spin. We try not to code while on vacation (a good team needs good rest), and I don't consider using our GUI-based VMX software as hardcore as "coding."
Unlike traditional vision systems whose operation might leave you with an engineering-hangover, using VMX is more akin to playing Minecraft. I figured that playing a video game or two on vacation is permissible.
Eliminating the residual sunscreen from my hands, I rebooted my soul with an iced gulp of Spice Isle Coffee and fired up my trusty Macbook Pro. I then grabbed the first few vacation-themed objects from the kitchen. (
And yes, I'm on vacation for Thanksgiving -- the objects include canned fruit, sunscreen, and a bottle of booze.) Then it was time to throw the barcode detection problem at VMX.
|
Step 1: Barcode Initial Selections |
30 seconds worth of initial clicks followed by several minutes worth of waving objects in front of the webcam is not hard work. 5 minutes later we have a sexy barcode detector. Not too bad for computer vision in a non-laboratory setting. While on vacation, I don't have access to a lab and neither should you. A sun-filled patio will have to suffice. In fact, it was so bright outside that I had to wear sunglasses the entire time. (Towards the end of the video, a "sunglasses" detector makes a cameo.)
Please note that he barcode is not actually "read" (so this program can't tell whether the region corresponds to canned pineapples or sunscreen), the region of interest is simply detected and tracked in real-time.
|
Final Step: Tweaking Learned Positives and Negatives |
This video is
an example of a pure machine-learning based approach to barcode detection. The underlying algorithm can be used to learn just about any visual concept you're interested in detecting. A bar code is just like a face or a car -- it is a 2D pattern which can be recognized by machines. Throughout my career I've trained thousands of detectors (mostly in an academic setting). VMX is the most fun with object recognition I've ever had and it lets me train detectors without having to worry about the mathematical details. Once you get your own copy of VMX, what will
you train?
To learn how to get your hands on VMX, sign up on the mailing list at
http://vision.ai or if you're daring enough, you can purchase an early beta license key from
https://beta.vision.ai.
So what's next? Should I build
a boat detector? Maybe I should train a detector to
let me know when I run low on Spice Isle Coffee? Or how about going on a field trip and
counting bikinis on the beach?
If you carefully look at the detections, you'll see that sometimes a subset of the barcode is detected as the barcode. A barcode is a special kind of object where this happens quite often. It's like a "sky" detector -- a piece of a sky is still a sky. Clearly the detection engine is designed for more rigid objects like faces, bicycles, hands, and rectangularish objects, but given the lighting conditions it's not too bad.
ReplyDeleteI enjoyed reading your posts. it very useful.
ReplyDeleteSelf Storage Seal Beach
Thanks for the usefull information, it will help a lot.
ReplyDeleteApartments for Rent in Houston
VMX looks fantastic, great work and congrats to you and the vision.ai team. I can't wait to take it for a spin!
ReplyDeletePretty element of content. I simply stumbled upon your blog and in accession capital to claim that I get actually loved account your weblog posts. Any way I’ll be subscribing on your feeds or even I success you access persistently rapidly.
ReplyDeletedetect barcode in .net