1. Remove Duplicates:
  2. Handle Missing Values:
  3. Standardize Text Data:
SOPs: Tools/Methods: Notes
Handling of duplicates Automated scripts
Data normalization
(e.g., consistent
terminology, date formats) Automated scripts
**Removal of irrelevant
information** Manual review Relevance of information is subjective and requires a domain expert.
Spelling and grammar checks Automated scripts
Removal of sensitive data
(if applicable). Automated scripts, Manual review “Sensitive Data” shall be defined according to the Data Privacy Act of 2012