Taking data protection into account in data collection and management

07 June 2024

The development of an artificial intelligence system requires rigorous management and monitoring of training data. The CNIL details how data protection principles relate to training data management.

Once the data and its sources are identified, the AI system provider must implement the collection and create its dataset. To this end, it is necessary to incorporate the principles of privacy by design from.

Data creation : Data collection ; Pre-processing : cleaning, annotation, feature extraction, data allocation

Collection


Data cleaning, data identification and privacy by design


Monitoring and updating


Data storage


Security


Documentation