Understanding Invoice Dataset for AI and OCR Model