超越Dlib!81個特徵點覆蓋全臉,面部特徵點檢測更精準(附代碼)

2020-12-01 和訊科技

　　新智元原創

　　來源：Reddit、GitHub

　　編輯: 金磊

　　【新智元導讀】人臉特徵點檢測是人臉檢測過程中的一個重要環節。以往我們採用的方法是OpenCV或者Dlib，雖然Dlib優於OpenCV，但是檢測出的68個點並沒有覆蓋額頭區域。Reddit一位網友便在此基礎上做了進一步研究，能夠檢測出81個面部特徵點，使得準確度有所提高。

　　或許，這就是你需要的人臉特徵點檢測方法。

　　人臉特徵點檢測(Facial landmark detection)是人臉檢測過程中的一個重要環節。是在人臉檢測的基礎上進行的，對人臉上的特徵點例如嘴角、眼角等進行定位。

　　近日，Reddit一位網友po出一個帖子，表示想與社區同胞們分享自己的一點研究成果：

　　其主要的工作就是在人臉檢測Dlib庫68個特徵點的基礎上，增加了13個特徵點(共81個) ，使得頭部檢測和圖像操作更加精確。

　　現在來看一下demo：

　　demo視頻連結：

　　https://www.youtube.com/watch?v=mDJrASIB1T0

　　81個特徵點，人臉特徵點檢測更加精準

　　以往我們在做人臉特徵點檢測的時候，通常會用OpenCV來進行操作。

　　但自從人臉檢測Dlib庫問世，網友們紛紛表示：好用！Dlib≥OpenCV！Dlib具有更多的人臉識別模型，可以檢測臉部68甚至更多的特徵點。

　　我們來看一下Dlib的效果：

　　Dlib人臉特徵點檢測效果圖

　　那麼這68個特徵點又是如何分布的呢？請看下面這張「面相圖」：

　　人臉68個特徵點分布

　　但無論是效果圖和「面相圖」，我們都可以發現在額頭區域是沒有分布特徵點的。

　　於是，網友便提出了一個特徵點能夠覆蓋額頭區域的模型。

　　該模型是一個自定義形狀預測模型，在經過訓練後，可以找到任何給定圖像中的81個面部特徵點。

　　它的訓練方法類似於Dlib的68個面部特徵點形狀預測器。只是在原有的68個特徵點的基礎上，在額頭區域增加了13個點。這就使得頭部的檢測，以及用於需要沿著頭部頂部的點的圖像操作更加精準。

　　81個特徵點效果圖

　　這13個額外的特徵點提取的方法，是根據該博主之前的工作完成的。

　　GitHub地址：

　　https://github.com/codeniko/eos

　　該博主繼續使用Surrey Face Model，並記下了他認為適合他工作的13個點，並做了一些細節的修改。

　　當然，博主還慷慨的分享了訓練的代碼：

　　1#!/usr/bin/python

　　2# The contents of this file are in the public domain. See LICENSE_FOR_EXAMPLE_PROGRAMS.txt

　　4# This example program shows how to use dlib's implementation of the paper:

　　5# One Millisecond Face Alignment with an Ensemble of Regression Trees by

　　6# Vahid Kazemi and Josephine Sullivan, CVPR 2014

　　8# In particular, we will train a face landmarking model based on a small

　　9# dataset and then evaluate it. If you want to visualize the output of the

　　10# trained model on some images then you can run the

　　11# face_landmark_detection.py example program with predictor.dat as the input

　　12# model.

　　13#

　　14# It should also be noted that this kind of model, while often used for face

　　15# landmarking, is quite general and can be used for a variety of shape

　　16# prediction tasks. But here we demonstrate it only on a simple face

　　17# landmarking task.

　　18#

　　19# COMPILING/INSTALLING THE DLIB PYTHON INTERFACE

　　20# You can install dlib using the command:

　　21# pip install dlib

　　22#

　　23# Alternatively, if you want to compile dlib yourself then go into the dlib

　　24# root folder and run:

　　25# python setup.py install

　　26#

　　27# Compiling dlib should work on any operating system so long as you have

　　28# CMake installed. On Ubuntu, this can be done easily by running the

　　29# command:

　　30# sudo apt-get install cmake

　　31#

　　32# Also note that this example requires Numpy which can be installed

　　33# via the command:

　　34# pip install numpy

　　36import os

　　37import sys

　　38import glob

　　40import dlib

　　42# In this example we are going to train a face detector based on the small

　　43# faces dataset in the examples/faces directory. This means you need to supply

　　44# the path to this faces folder as a command line argument so we will know

　　45# where it is.

　　46if len(sys.argv) != 2:

　　47 print(

　　48 "Give the path to the examples/faces directory as the argument to this "

　　49 "program. For example, if you are in the python_examples folder then "

　　50 "execute this program by running:\n"

　　51 " ./train_shape_predictor.py ../examples/faces")

　　52 exit()

　　53faces_folder = sys.argv[1]

　　55options = dlib.shape_predictor_training_options()

　　56# Now make the object responsible for training the model.

　　57# This algorithm has a bunch of parameters you can mess with. The

　　58# documentation for the shape_predictor_trainer explains all of them.

　　59# You should also read Kazemi's paper which explains all the parameters

　　60# in great detail. However, here I'm just setting three of them

　　61# differently than their default values. I'm doing this because we

　　62# have a very small dataset. In particular, setting the oversampling

　　63# to a high amount (300) effectively boosts the training set size, so

　　64# that helps this example.

　　65options.oversampling_amount = 300

　　66# I'm also reducing the capacity of the model by explicitly increasing

　　67# the regularization (making nu smaller) and by using trees with

　　68# smaller depths.

　　69options.nu = 0.05

　　70options.tree_depth = 2

　　71options.be_verbose = True

　　73# dlib.train_shape_predictor() does the actual training. It will save the

　　74# final predictor to predictor.dat. The input is an XML file that lists the

　　75# images in the training dataset and also contains the positions of the face

　　76# parts.

　　77training_xml_path = os.path.join(faces_folder, "training_with_face_landmarks.xml")

　　78dlib.train_shape_predictor(training_xml_path, "predictor.dat", options)

　　80# Now that we have a model we can test it. dlib.test_shape_predictor()

　　81# measures the average distance between a face landmark output by the

　　82# shape_predictor and where it should be according to the truth data.

　　83print("\nTraining accuracy: {}".format(

　　84 dlib.test_shape_predictor(training_xml_path, "predictor.dat")))

　　85# The real test is to see how well it does on data it wasn't trained on. We

　　86# trained it on a very small dataset so the accuracy is not extremely high, but

　　87# it's still doing quite good. Moreover, if you train it on one of the large

　　88# face landmarking datasets you will obtain state-of-the-art results, as shown

　　89# in the Kazemi paper.

　　90testing_xml_path = os.path.join(faces_folder, "testing_with_face_landmarks.xml")

　　91print("Testing accuracy: {}".format(

　　92 dlib.test_shape_predictor(testing_xml_path, "predictor.dat")))

　　94# Now let's use it as you would in a normal application. First we will load it

　　95# from disk. We also need to load a face detector to provide the initial

　　96# estimate of the facial location.

　　97predictor = dlib.shape_predictor("predictor.dat")

　　98detector = dlib.get_frontal_face_detector()

　　100# Now let's run the detector and shape_predictor over the images in the faces

　　101# folder and display the results.

　　102print("Showing detections and predictions on the images in the faces folder...")

　　103win = dlib.image_window()

　　104for f in glob.glob(os.path.join(faces_folder, "*.jpg")):

　　105 print("Processing file: {}".format(f))

　　106 img = dlib.load_rgb_image(f)

　　107

　　108 win.clear_overlay()

　　109 win.set_image(img)

　　110

　　111 # Ask the detector to find the bounding boxes of each face. The 1 in the

　　112 # second argument indicates that we should upsample the image 1 time. This

　　113 # will make everything bigger and allow us to detect more faces.

　　114 dets = detector(img, 1)

　　115 print("Number of faces detected: {}".format(len(dets)))

　　116 for k, d in enumerate(dets):

　　117 print("Detection {}: Left: {} Top: {} Right: {} Bottom: {}".format(

　　118 k, d.left(), d.top(), d.right(), d.bottom()))

　　119 # Get the landmarks/parts for the face in box d.

　　120 shape = predictor(img, d)

　　121 print("Part 0: {}, Part 1: {} ...".format(shape.part(0),

　　122 shape.part(1)))

　　123 # Draw the face landmarks on the screen.

　　124 win.add_overlay(shape)

　　125

　　126 win.add_overlay(dets)

　　127 dlib.hit_enter_to_continue()

　　有需要的小夥伴們，快來試試這個模型吧！

　　參考連結：

　　GitHub：

　　https://github.com/codeniko/shape_predictor_81_face_landmarks

　　Reddit：

　　https://www.reddit.com/r/MachineLearning/comments/b20b9i/p_i_trained_a_face_predictor_that_detects_fulls/

　　Youtube：

　　https://www.youtube.com/watch?v=mDJrASIB1T0

　　新智元春季招聘開啟，一起弄潮AI之巔！

　　崗位詳情請戳：

　　【春招英雄貼】新智元呼召智士主筆，2019勇闖AI之巔！

　　【2019新智元 AI 技術峰會倒計時9天】

　　2019年的3月27日，新智元再匯AI之力，在北京泰富酒店舉辦AI開年盛典——2019新智元AI技術峰會。峰會以「智能雲?芯世界「為主題，聚焦智能雲和AI晶片的發展，重塑未來AI世界格局。

　　同時，新智元將在峰會現場權威發布若干AI白皮書，聚焦產業鏈的創新活躍，評述華人AI學者的影響力，助力中國在世界級的AI競爭中實現超越。

　　購票二維碼

　　活動行購票連結：http://hdxu.cn/9Lb5U

　　點擊文末「閱讀原文」，馬上參會！

本文首發於微信公眾號：新智元。文章內容屬作者個人觀點，不代表和訊網立場。投資者據此操作，風險請自擔。

（責任編輯：何一華 HN110）

超越Dlib!81個特徵點覆蓋全臉,面部特徵點檢測更精準(附代碼)

相關焦點

【實驗】OpenCV/Python/dlib眨眼檢測

解密:面部特徵點檢測的關鍵技術

如何使用Python進行面部識別?

OpenCV特徵點檢測——ORB特徵

【OpenCV+Python】輪廓特徵中階

大牛200行Python代碼手把手教你如何做一個換臉小程序,附源碼

圖像特徵點|SIFT特徵點之圖像金字塔

乾貨|一文讀懂圖像局部特徵點檢測算法

一種用於SLAM/SFM的深度學習特徵點 SuperPoint

一種基於點雲的Voxel(三維體素)特徵的深度學習方法

圖像特徵點、投影變換與圖像拼接

曠視科技新增「狗臉識別」專利可檢測狗鼻紋特徵點

圖像特徵點、投影變換與圖像拼接

40行Python代碼,實現卷積特徵可視化

罕見的特徵：僅在少數人身上存在的5個罕見的身體特徵

寶寶面部有這3點特徵,代表天生智商高,傳說中的「一臉聰明相」

一文解析基於特徵點的視覺全局定位技術

深度學習閱讀導航 | 03 FPN:基於特徵金字塔網絡的目標檢測

超越Dlib!81個特徵點覆蓋全臉,面部特徵點檢測更精準(附代碼)

相關焦點

【實驗】OpenCV/Python/dlib眨眼檢測

解密:面部特徵點檢測的關鍵技術

如何使用Python進行面部識別?

OpenCV特徵點檢測——ORB特徵

【OpenCV+Python】輪廓特徵中階

大牛200行Python代碼手把手教你如何做一個換臉小程序,附源碼

圖像特徵點|SIFT特徵點之圖像金字塔

乾貨|一文讀懂圖像局部特徵點檢測算法

一種用於SLAM/SFM的深度學習特徵點 SuperPoint

一種基於點雲的Voxel(三維體素)特徵的深度學習方法

圖像特徵點、投影變換與圖像拼接

曠視科技新增「狗臉識別」專利 可檢測狗鼻紋特徵點

圖像特徵點、投影變換與圖像拼接

40行Python代碼,實現卷積特徵可視化

罕見的特徵：僅在少數人身上存在的5個罕見的身體特徵

寶寶面部有這3點特徵,代表天生智商高,傳說中的「一臉聰明相」

一文解析基於特徵點的視覺全局定位技術

深度學習閱讀導航 | 03 FPN:基於特徵金字塔網絡的目標檢測

曠視科技新增「狗臉識別」專利可檢測狗鼻紋特徵點