AI-driven digital assistants operate in safety-critical environments (airborne, air traffic control, and airports), where dataset quality directly determines model performance, robustness, and trustworthiness. As a result, dataset creation must be treated as a structured lifecycle, from acquisition to long-term maintenance. The whitepaper details these findings and provides examples and solutions found during the development activities of the JARVIS DAs.