Mate pair whole genome sequencing of 98 AML samples
This dataset contains bam-files from Mate-pair whole genome sequencing of 98 AML samples. DNA was extracted from either bone marow or peripheral blood from primary AML samples. The libraries were prepared using Illumina Nextera mate pair library preparation kit, generating long-insert (2-8 kb) paired end libraries. These were sequenced on an Illumina NextSeq 500 using 2x76bp paired end chemistry. The fastq files generated by sequencing were aligned to the human hg19 reference genome (ucsc.hg19.fasta from the GATK resource bundle) using bwa (0.7.15-r1140) and duplicate reads were identified using samblaster (0.1.24).
This dataset is 1 of 4 included in the study titled The cellular state space of AML unveils novel NPM1 subtypes with distinct clinical outcomes and immune evasion properties, http://identifiers.org/ega.study:EGAS50000001084.
Official landing page: http://identifiers.org/ega.dataset:EGAD50000001574