Key Metrics for Evaluating In-Car Keyword Spotting Models