Top Banner
PIFA Approach Introduction 3D Face Modeling Prior Work Experimental Results 3D Surface-Enable Visibility Cascaded Couple-Regressor , K K M p N K-th Projection Matrix Regressor K-th 3D Shape Parameter Regressor 1 2 1 1 1 1 1 arg min , , ; d k N k k k k k i i i i i M R IU v 1 1 0 1 S N k k i i i i i U M S p S 2 2 2 2 2 1 arg min , , ; d k N k k k k k i i i i i p R IU v 1 0 1 S N k k i i i i i U M S p S 1 k k k i i i M M M 1 f 2 f 2 f 3 f 3 f 3 f N 1 2 1 2 . T m m v N m m Number of images N Metric PIFA CDM RCPR TCDCN 468 6 MAPE 8.61 9.13 - - 313 5 NME 9.42 - 9.30 8.20 a 2×4 matrix with 7 degrees of freedom (pitch, yaw, roll, 2 scales and 2 translation). 1 2 1 2 ... ... N N u u u U MS v v v 1 2 1 2 0 1 1 2 ... ... ... 1 1 ... 1 S N N N i i i N x x x y y y S S pS z z z x y t M sR t 1 2 , ,..., S N p p p p 3D scans w. labels 2D image w. landmarks Method 3D landmark Visibility Pose-related database Pose range Landmark # Estimation Error RCPR (ICCV2013) NO Yes COFW frontal w. occlu 19 8.5 CoR (ECCV2014) NO Yes COFW; LFPW-O; Helen-O frontal w. occlu 19; 49; 49 8.5 TSPM (CVPR2012) NO NO AFW all poses 6 11.1 CDM (ICCV2013) NO NO AFW all poses 6 9.1 OSRD (CVPR2014) NO NO MVFW < ±40° 68 N/A TCDCN (ECCV2014) NO NO AFLW, AFW < ±60° 5 8.0, 8.2 PIFA Yes Yes AFLW, AFW all poses 21, 6 6.5, 8.6 We proposed a method which estimates 2D landmarks and their visibility for a face with arbitrary pose. estimates both the projection matrix and 3D landmarks. achieves superior performances than state of the art methods. Visible Landmarks Invisible Landmarks 3D scans w. labels 2D images w. labels 0 S 1 S 2 S 3D shape p Projection M 0 0 , , IM p Training Testing 3 f We rotate the 3D normal surface vectors according to the rotation angle indicated by projection matrix. The sign of z coordinate indicates 2D landmark visibility. 1 m 2 m Number of images Metric PIFA CDM RCPR 1299 NME 6.52 - 7.15 783 NME 6.08 8.65 - AFLW dataset experiments AFW dataset experiments Initialization Estimated Landmarks Visible Landmarks Invisible Landmarks 3 out of 9 zones with least occlusion are selected. For each selected zone, a depth 5 random fern regressor is learned. The final regressor is a weighted mean voting of 3 fern regressors. 1 k k k i i i p p p Any regressor might be used for R. We used Linear regressor Fern regressor M . . . . . . . . The average NME of each landmark The NME of five pose groups 3D estimation 1 1 R Projection Regressor 1 2 R Shape Regressor Shape Regressor 2 R K 1 R K Projection Regressor 468 faces in 205 images with poses ±90. Labeled with up to 6 visible landmarks. BP4D-S database experiments Includes pairs of 2D images and 3D scans of 41 subjects. Half of selected 1100 images for training and rest for testing. The mean 3D shape is used as a baseline (after global transformation). The MAPE of baseline is 5.02, while PIFA is 4.75. 5200 images selected evenly within yaw angles. Randomly partitioned into 3901 training and 1299 testing images. 0 ,30 , 30 ,60 , 60 ,90 is a function to extract 32 N-dim HOG feature vector. , f IU We represent 2D landmarks U as pair of projection matrix M and 3D shape parameter p.
1

Introduction PIFA Approach 3D Surface-Enable Visibility

Dec 07, 2021

Download

Documents

dariahiddleston
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Introduction PIFA Approach 3D Surface-Enable Visibility

PIFA Approach Introduction

3D Face Modeling

Prior Work Experimental Results

3D Surface-Enable Visibility

Cascaded Couple-Regressor

,K KM p

N

K-th Projection Matrix Regressor K-th 3D Shape Parameter Regressor

1

21

1 1 1

1

arg min , , ;d

k

Nk k k k k

i i i i

i

M R I U v

1 1

0

1

SNk k

i i i i

i

U M S p S

2

2

2 2 2

1

arg min , , ;d

k

Nk k k k k

i i i i

i

p R I U v

1

0

1

SNk k

i i i i

i

U M S p S

1k k k

i i iM M M

1f

2f 2f

3f 3f 3f

N

1 2

1 2

.T m m

v Nm m

Number of images N Metric PIFA CDM RCPR TCDCN

468 6 MAPE 8.61 9.13 - -

313 5 NME 9.42 - 9.30 8.20

a 2×4 matrix with 7 degrees of freedom (pitch, yaw, roll, 2 scales and 2 translation).

1 2

1 2

...

...

N

N

u u uU MS

v v v

1 2

1 2

0

11 2

...

...

...

1 1 ... 1

S

N

NN

i i

iN

x x x

y y yS S p S

z z z

x

y

tM sR

t

1 2, ,...,SNp p p p

3D scans

w. labels

2D image

w. landmarks

Method 3D

landmark Visibility

Pose-related

database Pose range Landmark #

Estimation

Error

RCPR

(ICCV2013) NO Yes COFW

frontal w.

occlu 19 8.5

CoR

(ECCV2014) NO Yes

COFW;

LFPW-O;

Helen-O

frontal w.

occlu 19; 49; 49 8.5

TSPM

(CVPR2012) NO NO AFW all poses 6 11.1

CDM

(ICCV2013) NO NO AFW all poses 6 9.1

OSRD

(CVPR2014) NO NO MVFW < ±40° 68 N/A

TCDCN

(ECCV2014) NO NO AFLW, AFW < ±60° 5 8.0, 8.2

PIFA Yes Yes AFLW, AFW all poses 21, 6 6.5, 8.6

We proposed a method which

• estimates 2D landmarks and their visibility for a face with arbitrary pose.

• estimates both the projection matrix and 3D landmarks.

• achieves superior performances than state of the art methods.

Visible Landmarks

Invisible Landmarks

3D scans

w. labels

2D images

w. labels 0S

1S 2S

3D shape p

Projection M

0 0, ,I M p

Training

Testing

3f

We rotate the 3D normal surface vectors according to the rotation angle indicated by projection matrix.

The sign of z coordinate indicates 2D landmark visibility.

1m

2m

Number of images Metric PIFA CDM RCPR

1299 NME 6.52 - 7.15

783 NME 6.08 8.65 -

AFLW dataset experiments

AFW dataset experiments

Initialization

Estimated

Landmarks

Visible Landmarks

Invisible Landmarks

• 3 out of 9 zones with least occlusion are selected.

• For each selected zone, a depth 5 random fern regressor is learned.

• The final regressor is a weighted mean voting of 3 fern regressors.

1k k k

i i ip p p

Any regressor might be used for R. We used

Linear regressor

Fern regressor

M

. . . .

. . . .

The average NME of each landmark

The NME of five pose groups

3D

estimation

1

1R

Projection

Regressor

1

2R

Shape

Regressor

Shape

Regressor

2R K

1R K

Projection

Regressor

• 468 faces in 205 images with poses ±90.

• Labeled with up to 6 visible landmarks.

BP4D-S database experiments

• Includes pairs of 2D images and 3D scans of 41 subjects.

• Half of selected 1100 images for training and rest for testing.

• The mean 3D shape is used as a baseline (after global transformation).

• The MAPE of baseline is 5.02, while PIFA is 4.75.

• 5200 images selected evenly within yaw angles.

• Randomly partitioned into 3901 training and 1299 testing images.

0 ,30 , 30 ,60 , 60 ,90

is a function to extract 32 N-dim HOG feature vector.

,f I U

We represent 2D landmarks U as pair of projection matrix M and 3D shape parameter p.