Text and Audio Alignment in TTS Datasets