Skip to main content

Quality Standards

rdm |

You will find bellow some general standards of data quality, from the UK Data Archive:

Quality control of data is an integral part of all research and takes place at various stages: during data collection, data entry or digitisation, and data checking. It is important to assign clear roles and responsibilities for data quality assurance at all stages of research and to develop suitable procedures before data gathering starts

  1. Quality control measures during data collection may include:

• calibration of instruments to check the precision, bias and/or scale of measurement

• taking multiple measurements, observations or samples

• checking the truth of the record with an expert

• using standardised methods and protocols for capturing observations, alongside recording forms with clear instructions

• computer-assisted interview software to: standardize interviews, verify response consistency, route and customise questions so that only appropriate questions are asked, confirm responses against previous answers where appropriate and detect inadmissible responses

      2. Quality control during digitalisation, entry or coding of data

The quality of data collection methods used strongly influences data quality and documenting in detail how data are collected provides evidence of such quality. When data are digitised, entered in a database or  spreadsheet, or coded, quality is ensured and error avoided by using standardised and consistent procedures with clear instructions. These may include:

• setting up validation rules or input masks in data entry software

• using data entry screens

• using controlled vocabularies, code lists and choice lists to minimise manual data entry

• detailed labelling of variable and record names to avoid confusion

• designing a purpose-built database structure to organise data and data files

             3. Qualitative data transcription quality

Transcription is a translation between forms of data, most commonly to convert audio recordings to text in

qualitative research. Whilst transcription is often part of the analysis process, it also enhances the sharing and reuse

potential of qualitative research data. Full transcription is recommended for data sharing.

If transcription is outsourced to an external transcriberI, attention should be paid to:

• data security when transmitting recordings and transcripts between researcher and transcriber

• data security procedures for the transcriber to follow

• a non-disclosure agreement for the transcriber

• transcriber instructions or guidelines, indicating required transcription style, layout and editing

Best practice is to:

• consider the compatibility of transcription formats with import features of qualitative data analysis software, e.g. loss of headers and formatting, before developing a template or guidelines

• develop a transcription template to use, especially if multiple transcribers carry out work

• ensure consistency between transcripts

• anonymise data during transcription, or mark sensitive information for later anonymization

Transcripts should:

• have a unique identifier that labels an interview either through a name or number

• have a uniform layout throughout a research project or data collection

• use speaker tags to indicate turn-taking or question/answer sequence in conversations

• carry line breaks between turn-takes

• be page numbered

• have a document cover sheet or header with brief interview or event details such as date, place, interviewer name, interviewee detail

            4. Data checking

During data checking, data are edited, cleaned, verified, cross-checked and validated.

Checking typically involves both automated and manual procedures. These may include:

• double-checking coding of observations or responses and out-of-range values

• checking data completeness

• verifying random samples of the digital data against the original data

• double entry of data

• statistical analyses such as frequencies, means, ranges or clustering to detect errors and anomalous values

• peer review

 

Finally, note that data quality best practices highly depend on the type of data collected and the research field. Therefore, contact your research institute to see whether some best practices are already available.