在Blender中使用代码控制人物模型的头部姿态 - 代码实践Dlib版本_blender 实现位姿估计

作者：笔触狂放9 | 2024-03-29 00:31:29

踩

blender 实现位姿估计

在Blender中使用代码控制人物模型的头部姿态 - 代码实践Dlib版本

flyfish

在文章的最后，贴上完整的代码
需要一个普通的摄像头，可以像虚拟UP主一样做头部动作

环境：
Ubuntu18.04
Blender 版本2.82
Dlib：用于人脸关键点检测
OpenCV版本3.4.16
solvePnP：从3D-2D点对应关系中找到一个目标的姿态，在这里用于通过人脸关键点计算出头部姿态
目标：输入二维的人脸关键点，输出三维的头部姿态
在这里插入图片描述
这里使用的dlib模型检测出68个关键点，在计算头部姿态时使用其中6个

关键点的索引分别是：代码中索引从0开始计算
下巴：8
鼻尖：30
左眼角：36
右眼角：45
左嘴角：48
右嘴角：54

3D头部模型（3D Head Model）
构建一个关键点的3D头部模型
在这里插入图片描述
所以会看到这样的代码

image_points = np.array([shape[30],     # Nose tip - 31
                            shape[8],      # Chin - 9
                            shape[36],     # Left eye left corner - 37
                            shape[45],     # Right eye right corne - 46
                            shape[48],     # Left Mouth corner - 49
                            shape[54]      # Right mouth corner - 55
                        ], dtype = np.float32)


# 3D model points.    
model_points = np.array([
                            (0.0, 0.0, 0.0),             # Nose tip
                            (0.0, -330.0, -65.0),        # Chin
                            (-225.0, 170.0, -135.0),     # Left eye left corner
                            (225.0, 170.0, -135.0),      # Right eye right corne
                            (-150.0, -150.0, -125.0),    # Left Mouth corner
                            (150.0, -150.0, -125.0)      # Right mouth corner
                        ], dtype = np.float32)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18

例如下巴：(0.0，-330.0，-65.0)这个叫世界坐标（World Coordinates），在OpenCV中叫模型坐标（Model Coordinates）

Blender中的目标存在一个世界中，摄像头拍摄的目标存在另一个世界中
在这里插入图片描述

在代码中，需要转换Blender和OpenCV和之间的坐标
在这里插入图片描述
通过对比两者的关系
X轴相同
Blender的Y轴是OpenCV的负Z轴
Blender的Z轴是OpenCV的Y轴
总结就是

Blender's red X axis = OpenCV 's X axis
Blender's green Y axis =  OpenCV's -Z axis
Blender's blue Z axis =  OpenCV's Y axis
1
2
3

所以会看到这样的代码

bones["head_fk"].rotation_euler[0] =  x
bones["head_fk"].rotation_euler[1] =  -z
bones["head_fk"].rotation_euler[2] =  y
1
2
3

在这里插入图片描述

solvePnP函数说明
输入
objectPoints - 世界坐标系下的关键点的坐标，即代码中的model_points
imagePoints - 在图像坐标系下对应的关键点的坐标。即代码中的image_points，与model_points的关键点顺序一致
cameraMatrix - 相机的内参矩阵
distCoeffs - 相机的畸变系数
输出
旋转向量(roatation vector)也就是上图中R
平移向量(translation vector)上图中的t

w：目标在世界坐标系下的坐标
c：目标在相机坐标系下的坐标
solvePnP求出w和c之间的映射关系

solvePnP函数返回结果包括旋转向量(roatation vector)和平移向量(translation vector)。这里只用了rotation_vector，赋值给bones[“head_fk”].rotation_euler
函数说明参考
https://docs.opencv.org/4.x/dc/d2c/tutorial_real_time_pose.html

solvePnP() 对异常点敏感,当相机拍摄目标时如果出现一些异常点，导致位姿估计的不准，可以尝试使用solvePnPRansac()
其他解决方案参考
https://docs.opencv.org/4.x/d9/d0c/group__calib3d.html#ga357634492a94efe8858d0ce1509da869
完整代码

import bpy
from imutils import face_utils
import dlib
import cv2
import time
import numpy as np

    
class DlibAnimOperator(bpy.types.Operator):
    """Operator which runs its self from a timer"""
    bl_idname = "wm.dlib_operator"
    bl_label = "Dlib Animation Operator"
    p = "/media/ubuntu/data/tool/blender-2.82-linux64/face/shape_predictor_68_face_landmarks.dat" 
    detector = dlib.get_frontal_face_detector()
    predictor = dlib.shape_predictor(p)
    # rig_name - take it from your scene collection tree
    rig_name = "RIG-Vincent"

    _timer = None
    _cap  = None
    
    width = 800
    height = 600

    stop :bpy.props.BoolProperty()
    
    # 3D model points.    
    model_points = np.array([
                                (0.0, 0.0, 0.0),             # Nose tip
                                (0.0, -330.0, -65.0),        # Chin
                                (-225.0, 170.0, -135.0),     # Left eye left corner
                                (225.0, 170.0, -135.0),      # Right eye right corne
                                (-150.0, -150.0, -125.0),    # Left Mouth corner
                                (150.0, -150.0, -125.0)      # Right mouth corner
                            ], dtype = np.float32)
    # Camera internals
    camera_matrix = np.array(
                            [[height, 0.0, width/2],
                            [0.0, height, height/2],
                            [0.0, 0.0, 1.0]], dtype = np.float32
                            )
    
    # Keeps a moving average of given length
    def smooth_value(self, name, length, value):
        if not hasattr(self, 'smooth'):
            self.smooth = {}
        if not name in self.smooth:
            self.smooth[name] = np.array([value])
        else:
            self.smooth[name] = np.insert(arr=self.smooth[name], obj=0, values=value)
            if self.smooth[name].size > length:
                self.smooth[name] = np.delete(self.smooth[name], self.smooth[name].size-1, 0)
        sum = 0
        for val in self.smooth[name]:
            sum += val
        return sum / self.smooth[name].size


    # Keeps min and max values, then returns the value in a ranve 0 - 1
    def get_range(self, name, value):
        if not hasattr(self, 'range'):
            self.range = {}
        if not name in self.range:
            self.range[name] = np.array([value, value])
        else:
            self.range[name] = np.array([min(value, self.range[name][0]), max(value, self.range[name][1])] )
        val_range = self.range[name][1] - self.range[name][0]
        if val_range != 0:
            return (value - self.range[name][0]) / val_range
        else:
            return 0

    def modal(self, context, event):

        if (event.type in {'RIGHTMOUSE', 'ESC'}) or self.stop == True:
            self.cancel(context)
            return {'CANCELLED'}

        if event.type == 'TIMER':
            self.init_camera()
            _, image = self._cap.read()
            gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
            rects = self.detector(gray, 0)
            # bpy.context.scene.frame_set(frame_num)
         
            # For each detected face, find the landmark.
            for (i, rect) in enumerate(rects):
                shape = self.predictor(gray, rect)
                shape = face_utils.shape_to_np(shape)
             
                #2D image points. If you change the image, you need to change vector
                image_points = np.array([shape[30],     # Nose tip - 31
                                            shape[8],      # Chin - 9
                                            shape[36],     # Left eye left corner - 37
                                            shape[45],     # Right eye right corne - 46
                                            shape[48],     # Left Mouth corner - 49
                                            shape[54]      # Right mouth corner - 55
                                        ], dtype = np.float32)
             
                dist_coeffs = np.zeros((4,1)) # Assuming no lens distortion
             
                if hasattr(self, 'rotation_vector'):
                    (success, self.rotation_vector, self.translation_vector) = cv2.solvePnP(self.model_points, 
                        image_points, self.camera_matrix, dist_coeffs, flags=cv2.SOLVEPNP_ITERATIVE, 
                        rvec=self.rotation_vector, tvec=self.translation_vector, 
                        useExtrinsicGuess=True)
                else:
                    (success, self.rotation_vector, self.translation_vector) = cv2.solvePnP(self.model_points, 
                        image_points, self.camera_matrix, dist_coeffs, flags=cv2.SOLVEPNP_ITERATIVE, 
                        useExtrinsicGuess=False)
             
                if not hasattr(self, 'first_angle'):
                    self.first_angle = np.copy(self.rotation_vector)
             
                bones = bpy.data.objects[self.rig_name].pose.bones
                x=self.smooth_value("h_x", 3, (self.rotation_vector[0] - self.first_angle[0])) 
                y=self.smooth_value("h_y", 3, (self.rotation_vector[1] - self.first_angle[1])) 
                z=self.smooth_value("h_z", 3, (self.rotation_vector[2] - self.first_angle[2])) 
                
                bones["head_fk"].rotation_euler[0] =  x
                bones["head_fk"].rotation_euler[1] =  -z
                bones["head_fk"].rotation_euler[2] =  y
                bones["head_fk"].keyframe_insert(data_path="rotation_euler", index=-1)
        
         
                for (x, y) in shape:
                    cv2.circle(image, (x, y), 2, (0, 255, 255), -1)
                 
            cv2.imshow("Output", image)
            cv2.waitKey(1)

        return {'PASS_THROUGH'}
    
    def init_camera(self):
        if self._cap == None:
            self._cap = cv2.VideoCapture(0)
            #self._cap = cv2.VideoCapture("/media/ubuntu/data/sign_videos/HabenSieSchmerzen0.mp4")
            
            self._cap.set(cv2.CAP_PROP_FRAME_WIDTH, self.width)
            self._cap.set(cv2.CAP_PROP_FRAME_HEIGHT, self.height)
            self._cap.set(cv2.CAP_PROP_BUFFERSIZE, 1)
            time.sleep(0.5)
    
    def stop_playback(self, scene):
        print(format(scene.frame_current) + " / " + format(scene.frame_end))
        if scene.frame_current == scene.frame_end:
            bpy.ops.screen.animation_cancel(restore_frame=False)
        
    def execute(self, context):
        bpy.app.handlers.frame_change_pre.append(self.stop_playback)

        wm = context.window_manager
        self._timer = wm.event_timer_add(0.02, window=context.window)
        wm.modal_handler_add(self)
        return {'RUNNING_MODAL'}

    def cancel(self, context):
        wm = context.window_manager
        wm.event_timer_remove(self._timer)
        cv2.destroyAllWindows()
        self._cap.release()
        self._cap = None

def register():
    bpy.utils.register_class(DlibAnimOperator)

def unregister():
    bpy.utils.unregister_class(DlibAnimOperator)

if __name__ == "__main__":
    register()

    # test call
    bpy.ops.wm.dlib_operator()
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174

恢复原始位置执行的代码

import bpy
import math
import mathutils

def Original():
    ob = bpy.data.objects['RIG-Vincent'] 
    head=ob.pose.bones['head_fk']  
    head.rotation_mode = 'XYZ'
    head.rotation_euler = mathutils.Euler((0.0, 0.0, 0.0), 'XYZ')
    head.rotation_mode = 'QUATERNION'
    head.keyframe_insert(data_path="rotation_euler" ,frame=-1)
       
Original()
1
2
3
4
5
6
7
8
9
10
11
12
13

声明：本文内容由网友自发贡献，不代表【wpsshop博客】立场，版权归原作者所有，本站不承担相应法律责任。如您发现有侵权的内容，请联系我们。转载请注明出处：https://www.wpsshop.cn/w/笔触狂放9/article/detail/332511