Merge pull request #3 from Tobias-Fischer/facial_landmarks

Improved Facial Landmarks detection using face-alignment package
Tobias-Fischer · Nov 6, 2018 · 8a06793 · 8a06793
2 parents f17737c + d7555f5
commit 8a06793
Show file tree

Hide file tree

Showing 4 changed files with 55 additions and 232 deletions.
diff --git a/rt_gene/README.md b/rt_gene/README.md
@@ -20,7 +20,7 @@ More information can be found on the Personal Robotic Lab's website: <https://ww
 ## Manual installation
 1) Download, install, and configure ROS (full installation; we recommend the Kinectic distribution of ROS): http://wiki.ros.org/kinetic/Installation
 1) `sudo apt-get install python-catkin-tools ros-$ROS_DISTRO-ros-numpy ros-$ROS_DISTRO-camera-info-manager-py ros-$ROS_DISTRO-uvc-camera`
-1) `pip install tensorflow-gpu keras numpy scipy tqdm torch torchvision Pillow`
+1) `pip install tensorflow-gpu keras numpy scipy tqdm torch torchvision Pillow face-alignment`
 1) `cd $HOME/catkin_ws/src && git clone https://github.com/Tobias-Fischer/rt_gene.git`
 1) `cd $HOME/catkin_ws && catkin build`
 
@@ -32,12 +32,6 @@ More information can be found on the Personal Robotic Lab's website: <https://ww
     - `wget https://imperialcollegelondon.box.com/shared/static/hmcoxopu4xetic5bm47xqrl5mqktpg92.h5 -O $(rospack find rt_gene)/model_nets/all_subjects_mpii_prl_utmv_3_02`
 - Finally, open `$(rospack find rt_gene)/launch/estimate_gaze.launch` and comment out `<rosparam param="model_files">['model_nets/Model_allsubjects1.h5']</rosparam>` and uncomment `<!--rosparam param="model_files">['model_nets/all_subjects_mpii_prl_utmv_0_02.h5', ..., ..., ...</rosparam-->`
 
-## Improved head pose estimation
-This is currently disabled by default as the DNN module in the OpenCV shipped by ROS is broken (see https://github.com/ros-gbp/opencv3-release/issues/20). To enable, build OpenCV3 from sources and link the ROS OpenCV to the one compiled from sources. Then download the following files:
-- `wget https://github.com/yinguobing/head-pose-estimation/raw/master/assets/res10_300x300_ssd_iter_140000.caffemodel -P $(rospack find rt_gene)/model_nets`
-- `wget https://raw.githubusercontent.com/yinguobing/head-pose-estimation/master/assets/deploy.prototxt -P $(rospack find rt_gene)/model_nets`
-
-Finally, set `use_mtcnn` to `False` in `$(rospack find rt_gene)/launch/estimate_gaze.launch`
 
 ## Requirements for live gaze estimation (Kinect One)
 - Follow instructions for https://github.com/code-iai/iai_kinect2

diff --git a/rt_gene/launch/estimate_gaze.launch b/rt_gene/launch/estimate_gaze.launch
@@ -1,6 +1,4 @@
 <launch>
-    <arg name="use_mtcnn" default="True" />
-
     <arg name="interpupillary_distance" default="0.058"/>
     <arg name="start_rviz" default="False"/>
 
@@ -24,7 +22,6 @@
             args="0.0 0.0 0.0 0.0 0.0 0.0 1.0 $(arg rgb_frame_id_ros) $(arg rgb_frame_id) 100" unless="$(arg is_frame_rotated)" />
 
       <node pkg="rt_gene" type="extract_landmarks_node.py" name="extract_landmarks_new" output="screen">
-          <param name="use_mtcnn" value="$(arg use_mtcnn" />
           <param name="rgb_frame_id" value="$(arg rgb_frame_id)" />
           <param name="rgb_frame_id_ros" value="$(arg rgb_frame_id_ros)" />
           <param name="tf_prefix" value="$(arg tf_prefix)" />

diff --git a/rt_gene/scripts/estimate_gaze.py b/rt_gene/scripts/estimate_gaze.py
@@ -101,7 +101,9 @@ def estimate_gaze_twoeyes(self, test_input_left, test_input_right, headpose):
                 predictions.append(model.predict({'img_input_L': test_input_left,
                                                   'img_input_R': test_input_right,
                                                   'headpose_input': test_headpose})[0])
-            return np.mean(np.array(predictions), axis=0)
+            mean_prediction = np.mean(np.array(predictions), axis=0)
+            # mean_prediction[1] += 0.11 # when using the ensemble model - they're all off then!
+            return mean_prediction
 
     def visualize_eye_result(self, eye_image, est_gaze):
         """Here, we take the original eye eye_image and overlay the estimated gaze."""