How a doctor–patient speech dataset is built?