The raw data output of sequencing provides sequencing ‘reads’ (strings of DNA sequence) that can be analysed using a number of basic bioinformatic steps, which when combined together are known as ‘pipelines’. These bioinformatic pipelines remove sequencing errors or dubious reads in order to ‘clean’ the data which can then be aligned to reference genomic databases in order to accurately identify and profile the samples.