Quality vs Cost in Speech Datasets: Key Trade-offs