How is Command Data Structured for Use in Speech Recognition Models?