The Math and Code Behind…
Facial alignment is a prereq…
3 years, 4 months ago
by Sabbir Ahmed
Face detection is the step 1 process for various vision projects. Face detection, as the name suggests, is identifying human faces in a digital image. Detecting face is an essential step in many computer vision applications like face analysis, face tracking, face recognition.
Haar Feature Based Cascades Classifier
It is a machine learning based approach in which the classifier is trained with a lot of positive images (with face) and a lot of negative images (without face). To extract haar features the classifier uses 3 images.
Now, we need to find the sum of all the pixel intensities in the black and white region which is a very process hungry. So we use Integral Image (preprocessed image) which trims down the operation in constant time. Enough jargon, for now, a piece of very important information to remember is OpenCV's classifier was trained with 24x24.
So The smallest detected face from any image would be at least 24x24.
Opencv Haar Cascade Detection
Opencv has many pretrained classifiers, ie. face, eye, full/half body, smile, etc. We will learn to user frontal face classifier others will follow the same direction. Let's begin -
Step 1: Defining the detector/classifier
import cv2
# loading the haar cascade classifier from XML file
clf = cv2.CascadeClassifier('./data/haarcascade_frontalface_default.xml')
Step 2: Loading image and converting the image into grayscale
well, this is pretty obvious, right? We won't be needing any RGB or BGR (OpenCV format) as the classifier is trained on black and white features.
# loading image
img = cv2.imread('
./data/blog_post_face_detection.jpg')
print(img.shape) # image shape (height x width x number of channel)
gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
Step 3: Detecting and marking the Region of Interest (RIO)
Detecting is easy, we will use OpenCV's detectMultiScale
method on the classifier with some parameters. some useful parameters can be -
Now, we have to specify and draw the ROI. the detectMultiScale
returns list of detected faces with their initial co-ordinate(x, y) and width, height. With this information, we can easily finalize our solution -
# detect and draw rectangle
faces = clf.detectMultiScale(gray, 1.3, 5, (30, 30))
print(f"total face detected: {len(faces)}")
for (x,y,w,h) in faces:
img = cv2.rectangle(img, (x, y),(x + w, y + h), (255, 0, 0), 2)
# displaying the image and cleaning up
cv2.imshow('img',img)
cv2.waitKey(0)
cv2.destroyAllWindows()
Output:
Detected 29 faces with 1 false positive.
Face Detection with Dlib HOG(Histogram of Oriented Gradients)
The idea of HOG descriptor is to create a vector of features so that the vector can be feed into a classification algorithm like SVM to predict the result. to calculate the HOG descriptor we need to calculate the gradients of x-axis and y-axis using Sobel kernel.
The code for Dlib face detector is pretty similar so I am going to write the code in one go...
import cv2
import dlib
# loading the haar cascade classifier from XML file
detector = dlib.get_frontal_face_detector()
# loading image
img = cv2.imread('./data/blog_post_face_detection.jpg')
faces = detector(img, 1)
for face in faces:
x1, y1 = face.tl_corner().x, face.tl_corner().y
x2, y2 = face.br_corner().x, face.br_corner().y
cv2.rectangle(img, (x1, y1), (x2, y2), (0, 255, 0), 1)
# displaying the image and cleaning up
cv2.imshow(f'face detected: {len(faces)}',img)
cv2.waitKey(0)
cv2.destroyAllWindows()
Output:
Detected 28 faces with 0 false positive.