1. Vector: Embedding, Latent Representation, Latent Code
2. Binary Classifier 评估 Encoder
3. Feature Disentangle 特征拆解
3.1 声音变声
3.2 IN & AdaIN
IN = Instance Normalization (remove global information)
AdaIN = Adaptive Instance Normalization (only influence global information)
4. Discrete Representation
Binary vector (参数较少,还可以识别没有见到的样本)
参考文献
Machine Learning (2019,Spring)
Voice Conversion