Evaluating Diversity in TTS Datasets