A Brief Introduction To Computer Vision
A Brief Introduction To Computer Vision
In the past few decades we’ve made numerous strides to extend visual ability to
machines. We’ve been successful in imitating how the human eye captures light and
colour. For example, cameras. But, capturing images is easy. Understanding what’s
there in the image is a tedious job.
Our brain can look at it and immediately know what it is (it’s the ACM logo). Our brains
are cheating since we’ve got a lot of evolutionary context to help us immediately
understand what this is. But a computer cannot do that. An algorithm sees the above
image like this:
Just a huge set of numbers which represent intensities of colours. There’s no context
here, just a huge pile of data. The context is the crux of getting algorithms to understand
image content in the same way as the human brain does.
To achieve the same, we use an algorithm which works very similar to how the human
brain functions using “Machine Learning”. Machine learning allows us to train the context
for a data set so that an algorithm can understand what all those integer values in that
specific organisation actually represent. This is just an outline of how computer vision
works. The actual algorithm is way more complex.