In total, the EMPOP Release 12 holds 42,839 quality-controlled mitotypes with at least HVS-I variation (16024-16365). Thereof,
- 41,385 cover HVS-I and HVS-II (16024-16365 73-340)
- 33,447 cover the Control Region (16024-576)
- 1,366 cover the entire mitogenome (ALL)
EMPOP holds high quality population data
The EMPOP database aims at the collection, quality control and searchable presentation of mtDNA haplotypes from diverse world populations.
The scientific concept and the quality control measures using logical and phylogenetic tools were found suitable for forensic purposes, by e.g.
- by declaration of the German Supreme Court of Justice (2010)
- the SWGDAM mtDNA interpretation guidelines (2013)
- and the updated ISFG guidelines for mtDNA analysis (2014)
Alignment-free query engine for unbiased search results and pyhlogenetic alignment of mtDNA sequences
EMPOP uses SAM2, a string-based search algorithm that converts query and database sequences into alignment-free nucleotide strings and thus guarantees that a haplotype is found in a database query regardless of its alignment.
Furthermore, SAM2 provides the phylogenetic alignment of mtDNA sequences to harmonize alignment and notation of mtDNA sequences.
EMPOP generates conservative haplogroup estimates
EMPOP provides automated haplogroup estimates. These are based on maximum likelihood and minimal costs functions. For multiple possible haplogroups most recent common ancestor (MRCA) haplogroups are provided.
The geographical haplogroup patterns are provided via maps to visualize and better understand their distribution.