Benchmark Dataset in Speech AI Explained