EMPOP Database
In total, the EMPOP Release 13 holds 48,572 quality-controlled mitotypes with at least HVS-I variation (16024-16365). Thereof,
- 46,963 cover HVS-I and HVS-II (16024-16365 73-340)
- 38,361 cover the Control Region (16024-576)
- 4,289 cover the entire mitogenome (ALL)
EMPOP holds high quality population data
The EMPOP database aims at the collection, quality control and searchable presentation of mtDNA haplotypes from diverse world populations.
The scientific concept and the quality control measures using logical and phylogenetic tools were found suitable for forensic purposes, by e.g.
- by declaration of the German Supreme Court of Justice (2010)
- the SWGDAM mtDNA interpretation guidelines (2013)
- and the updated ISFG guidelines for mtDNA analysis (2014)
Alignment-free query engine for unbiased search results and pyhlogenetic alignment of mtDNA sequences
EMPOP uses SAM2, a string-based search algorithm that converts query and database sequences into alignment-free nucleotide strings and thus guarantees that a haplotype is found in a database query regardless of its alignment.
Furthermore, SAM2 provides the phylogenetic alignment of mtDNA sequences to harmonize alignment and notation of mtDNA sequences.
EMPOP generates conservative haplogroup estimates
EMPOP provides automated haplogroup estimates. These are based on maximum likelihood and minimal costs functions. For multiple possible haplogroups most recent common ancestor (MRCA) haplogroups are provided.
The geographical haplogroup patterns are provided via maps to visualize and better understand their distribution.