DNAChron A New High Precision Y-chromosome Haplogroup Tree, Free To Upload Now!

DNAChron

Junior Member
Messages
5
Reaction score
0
Points
1
Introduction

DNAChron is a new upload site for Y chromosome DNA sequencing data on Y-chromosome haplogroup tree. DNAChron is not just another Y tree, because of not satisfied with current Y trees's age estimated method, which discard almost half of all SNPs, and all INDELs, that lead to loss on accuracy. We developed a new algorithm to get as much as possible precision on mutation analysis and age estimation, then DNAChron is born.
We hope this new algorithm can make help to discover more unknown history. Chronicle your unique story in the past.
Overview

  • High precision mutation analysis. On average, we can find 30%~100% more mutations per haplogroup than other Y-trees. Based on the existing data, more private mutations can be found for you. Find more subclades that can be merged on the rapid expanded haplogroup, clarify the relationship between haplogroups. Your newly discovered mutation will be named with prefix C (Chronicle).
  • Analyse in whole Y-chromosome sequence, totally use 18M data. Free to remap bam/cram/fastq data with Hg38 reference while retain the data mapping to T2T reference(CP086569.2). Activate the potential of your whole genome sequencing data.
  • High precision age estimation. We can provide the probability distribution diagram of different years. The pricision of effective age estimation is increased by 3~5 times.
  • Real age estimation, when dealing with continuous expand haplogroups, like R-L151, there will be no age contradiction between upstream and downstream, and there will be no deviation between age difference and mutation number between haplogroups. Solve the systematic deviation of the traditional algorithm that the more subclades, the smaller the estimation age is.
  • Provide matching between ISOGG of different versions and YTree. Help quickly understand the main haplogroup. Help find the corresponding relationship between the historical ISOGG ID found in a large number of papers and network information and the current haplogroup.
  • For samples with low coverage of sequencing data, it can also show you which subclades may be merged with in addition to the current tree location, or even belong to the downstream of a sibling subclade. That is, the tree uncertainty.
  • By registering, you can browse the analysis results and raw data of more than 7000 public samples.
  • Rapid, automatically and standardized analysis process. You can get result within 3 working days after importing data.
  • Abundant function: mutation, haplogroup and ISOGG search, browsing raw data, browsing reference sequence, browsing mutation info, personal information publish, etc.
  • Datacenter in Singapore.
  • The DNAChron Y-tree is under construction, FREE to upload now!
DNAChron VS Others

DNAChronOthers
Mutation analysis precisionOn average, 60 years per mutation. It is easier to clarify the rapid expanded haplogroup.On average, 80~100 years per mutation.
Confidence intervalYes, with probability distribution diagramProvide 95% CI range, or nothing.
True confidence intervalYesThe age is estimated according to fixed around 8M data range, which does not change with sample. Nowadays, commercial sequencing can cover more than 14M.
Age estimation precisionHighest precision can reach 60 years per mutation.Fix to more than 100 year per mutation
Real age estiamtionYesThe relationship between upstream and downstream is not considered. May have age contradiction between upstream and downstream, and there may be serious deviation between age difference and mutation number between haplogroups
Accuracy of age estimation increases with the number of samplesYesWhen the number of samples and subclades increases to a certain extent, the accuracy cannot be further improved due to the contradiction between upstream and downstream age estimation.



Snapshots (Can't paste image now)

New find in widely known Haplogroups:


Age estimations :


No contradiction age estimation between upstream and downstream:


Browse public sample:


Browse historical ISOGG tree, and get it's matching YTree node:


Information publication:


High Precision Mutation Analysis

  • No pre-defined mask region, all region on Y-chromosome will be analysed. DNAChron can find out small piece of stable region in many traditionally discarded regions.
  • The stability of each muation is verified by whole y-tree.
  • Support to find SNP, INDEL and MNP/complex
On average, we can find 30%~100% more mutations per haplogroup than other Y-trees.
High Precision Age Estimation

  • With the benefit of high precision mutation analysis, the tree is more accuracy, and we can find more mutations for age estimation.
  • All 18M bases are used for age estimation, and include every muation type: SNP, INDEL, MNP/complex. DNAChron can get a much higher mutation rate than traditional algorithm.
  • The calculation of mutation rate depends on the actual sample coverage, data include low coverage data and whole sequence data can get it's real age estimateion probability distribution.
  • The probability distribution can transfer between haplogroups, there will be no age contradiction between upstream and downstream, and there will be no deviation between age difference and mutation number between haplogroups.
  • Each haplogroup's age estimation have an extra independent condition from upstream.
Mutations

To provide an offline, standard, batch process tools for conversion and search on mutation names and positions from ybrowse. We create a github repository dnachronYdb .

  • Download only the delta, don't need to download whole hundreds of megabytes data every time.
  • Exceed the max 1048576 row limitation of Excel, can search on all mutation info at one time.
  • Using database index, search mutation names and positions efficiently.
  • Standardize indel mutation, left alignment, and unify REF and ALT formats.
  • Add additional mutation naming date. For mutations with repeated name, the earliest name can be selected according to the naming date.
  • Using the database format, you can use program to batch process mutation names and position conversion. Please check the processing tools we provide. Github repository dnachronYdb-putils .
Privacy & Security
DNAChron will store your registration information, tester information, analysis result information and genetic information data separately on the cloud, and protect them in all aspects through reasonable and feasible security measures such as technology, hardware and management process.
Detail privacy policy please view on the website.
Reference
The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes
Generation of high-resolution a priori Y-chromosome phylogenies using “next-generation” sequencing data
The Y-chromosome point mutation rate in humans
Defining a New Rate Constant for Y-Chromosome SNPs based on Full Sequencing Data
Improved Models of Coalescence Ages of Y-DNA Haplogroups
The study of human Y chromosome variation through ancient DNA
Present-Day DNA Contamination in Ancient DNA Datasets
Computational challenges in the analysis of ancient DNA
 
Last edited:
Examples of new findings in widely known Haplogroups because of high precision mutation analysis:

New Haplogroup between R-Y2568 and R-Y879
New Haplogroup between R-Z280 and R-Z92, R-CTS1211
New Haplogroup between R-Z214 and R-CTS7870, R-Y151131
New Haplogroup between J-PF7321 and J-CTS1989
New Haplogroup between I-Z60 and I-Z140, I-CTS7362, I-FGC23806
New Haplogroup between E-Z5018 and E-L17, E-S2979
New Haplogroup between E-Y8829 and E-A1152, E-Z5011
New Haplogroup between E-Y20406 and E-Y72713, E-Y142136

 
Last edited:
You just upload your y700 bam file and then just wait?
 
I'll upload my dad's WGS later, why not.


Edit:

Can someone link the website?
 
I need 20 posts to paste link and image. You can search DNAChron with google or bing.

y700 bam file or other WGS bam/cram/fastq file are all acceptable :bigsmile:.
 
Well uploaded my dna file, what I can see at the moment its E-Z38456
 
Thank you! The haplogroup will keep in update after other users upload.
 
Recently new findings in widely known Haplogroups

BY3642 between R-Z193 and R-Y16335, R-Y44783

13298002 T->TA between J-CTS6152 and J-BY74, J-Z2293
 
I uploaded my dad's WGS but nothing too exciting, R-BY175662.
 

This thread has been viewed 3019 times.

Back
Top