Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Last update: Dec 01, 2022

Overview

Interpretable Control Exploration and Counterfactual Explanation (ICE) on StyleGAN

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Bo Li, Qiulin Wang, Jiquan Pei, Yu Yang, Xiangyang Ji

Abstract: The semantically disentangled latent subspace in GAN provides rich interpretable controls in image generation. This paper includes two contributions on semantic latent subspace analysis in the scenario of face generation using StyleGAN2. First, we propose a novel approach to disentangle latent subspace semantics by exploiting existing face analysis models, e.g., face parsers and face landmark detectors. These models provide the flexibility to construct various criterions with very concrete and interpretable semantic meanings (e.g., change face shape or change skin color) to restrict latent subspace disentanglement. Rich latent space controls unknown previously can be discovered using the constructed criterions. Second, we propose a new perspective to explain the behavior of a CNN classifier by generating counterfactuals in the interpretable latent subspaces we discovered. This explanation helps reveal whether the classifier learns semantics as intended. Experiments on various disentanglement criterions demonstrate the effectiveness of our approach. We believe this approach contributes to both areas of image manipulation and counterfactual explainability of CNNs.

The code is developed on NVlabs/stylegan2-ada-pytorch and put in the ice folder. Please play with the two ipython notebooks.

ice/discover_subspaces

Solve subspaces by using face analysis models as criterions. Currently we only include several representative subspaces. The notebook requires to download some pre-trained models. You might have to spend some efforts to put everything at the right place. See the notebook comments for details. This notebook shows the code sketch to generate Figure 3 (as below) in the paper, i.e., the latent subspace for interpretable face manipulation.

ice/explain_counterfactually

Use the interpretable subspaces discovered by the above notebook to explain the classifier of attractiveness. This notebook shows the code sketch to generate Figure 4 (as below) in the paper, i.e., the interpretable counterfactuals to increase attractiveness score of a given classifier. Since we did not find good public pre-trained model. The attractiveness classifier is trained by ourselves using d-li14/face-attribute-prediction.

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Related tags

Overview

Interpretable Control Exploration and Counterfactual Explanation (ICE) on StyleGAN

Owner

Bo Li

[NeurIPS 2021] Large Scale Learning on Non-Homophilous Graphs: New Benchmarks and Strong Simple Methods

Deep learning with TensorFlow and earth observation data.

Official code for the ICLR 2021 paper Neural ODE Processes

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

This is the dataset and code release of the OpenRooms Dataset.

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Landmarks Recogntion Web application using Streamlit.

Bot developed in Python that automates races in pegaxy.

Official Implementation of "Tracking Grow-Finish Pigs Across Large Pens Using Multiple Cameras"

Official git repo for the CHIRP project

The official implementation of "Rethink Dilated Convolution for Real-time Semantic Segmentation"

Code release for Hu et al. Segmentation from Natural Language Expressions. in ECCV, 2016

Official Repository for "Robust On-Policy Data Collection for Data Efficient Policy Evaluation" (NeurIPS 2021 Workshop on OfflineRL).

Video-Music Transformer

Simple helper library to convert a collection of numpy data to tfrecord, and build a tensorflow dataset from the tfrecord.

PyTorch3D is FAIR's library of reusable components for deep learning with 3D data

A PyTorch Implementation of SphereFace.

Transfer Reinforcement Learning for Differing Action Spaces via Q-Network Representations

Exploring Image Deblurring via Blur Kernel Space (CVPR'21)

Fast image augmentation library and easy to use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about library: https://www.mdpi.com/2078-2489/11/2/125