In this folder, we store code for co-supervising audio pitch detection network from visual height detection network.