Admixtools admixtools2 TUTORIAL for WINDOWS.

Celtion · Nov 26, 2024

Jalisciense said:
it was that package ^

Or did I install it wrong?

Then I copied the executables in EIG-master bin folder to the bin folder:

Then I convert mydata plink files to PACKEDANCESTRYMAP:

Code:

jalisciense@vbox:~> cd bin jalisciense@vbox:~/bin> convertf -p par.PACKEDPED.PACKEDANCESTRYMAP parameter file: par.PACKEDPED.PACKEDANCESTRYMAP genotypename: mydata.bed snpname: mydata.bim indivname: mydata.fam outputformat: PACKEDANCESTRYMAP genotypeoutname: mydata.packedancestrymapgeno snpoutname: mydata.snp indivoutname: mydata.ind familynames: NO ## convertf version: 5000 genetic distance set from physical distance ID001 ignored all individuals set ignore. Likely input problem (col 6) resetting all individual... genotype file processed numvalidind: 1 maxmiss: 1001 packedancestrymap output ##end of convertf run jalisciense@vbox:~/bin>

So I tried to merged them:

Code:

jalisciense@vbox:~> cd bin jalisciense@vbox:~/bin> mergeit -p merge_param.par parameter file: merge_param.par geno1: v62.0_1240k_public.geno snp1: v62.0_1240k_public.snp ind1: v62.0_1240k_public.ind geno2: mydata.packedancestrymapgeno snp2: mydata.snp ind2: mydata.ind genooutfilename: mydata_merged_with_1240k.geno snpoutfilename: mydata_merged_with_1240k.snp indoutfilename: mydata_merged_with_1240k.ind allele funny: rs2679218 T C X X allele funny: rs7418365 A G X X allele funny: rs2495090 C T X X allele funny: rs6686602 G T X X allele funny: rs97922 T G X X allele funny: rs9645433 A G X X allele funny: rs6678619 G A X X allele funny: rs2149116 G A X X allele funny: rs12097924 G T X X allele funny: rs12566647 G A X X allele funny: rs1772782 A G X X Etc etc allele funny: rs etc etc Killed jalisciense@vbox:~/bin>

What did I do wrong?

And answering you questions about my raw data:

AncestryDNA raw data
build 37.1
Forward (+)

Okay your individual file didn’t convert properly. Open the .fam file and change the -9 to a 1 or 2. Then convert it to PAM format again. Then merge with the AADR again.

Jalisciense · Nov 26, 2024

Celtion said:
Okay your individual file didn’t convert properly. Open the .fam file and change the -9 to a 1 or 2. Then convert it to PAM format again. Then merge with the AADR again.

I changed the -9 to 1

FAM001 ID001 0 0 1 1

And I run it:

Code:

jalisciense@vbox:~> cd bin
jalisciense@vbox:~/bin>  convertf -p par.PACKEDPED.PACKEDANCESTRYMAP
parameter file: par.PACKEDPED.PACKEDANCESTRYMAP
genotypename: mydata.bed
snpname: mydata.bim
indivname: mydata.fam
outputformat: PACKEDANCESTRYMAP
genotypeoutname: mydata.packedancestrymapgeno
snpoutname: mydata.snp
indivoutname: mydata.ind
familynames: NO
## convertf version: 5000
genetic distance set from physical distance
genotype file processed
numvalidind:      1  maxmiss:  1001
packedancestrymap output
##end of convertf run
jalisciense@vbox:~/bin>

I made a file: merge_param.par (not .txt) and this was inside:

geno1: v62.0_1240k_public.geno
snp1: v62.0_1240k_public.snp
ind1: v62.0_1240k_public.ind
geno2: mydata.packedancestrymapgeno
snp2: mydata.snp
ind2: mydata.ind
genooutfilename: mydata_merged_with_1240k.geno
snpoutfilename: mydata_merged_with_1240k.snp
indoutfilename: mydata_merged_with_1240k.ind

So I run it:

Code:

jalisciense@vbox:~> cd bin
jalisciense@vbox:~/bin> mergeit -p merge_param.par
parameter file: merge_param.par
geno1: v62.0_1240k_public.geno
snp1: v62.0_1240k_public.snp
ind1: v62.0_1240k_public.ind
geno2: mydata.packedancestrymapgeno
snp2: mydata.snp
ind2: mydata.ind
genooutfilename: mydata_merged_with_1240k.geno
snpoutfilename: mydata_merged_with_1240k.snp
indoutfilename: mydata_merged_with_1240k.ind
allele funny: rs2679218 T C X X
allele funny: rs7418365 A G X X
allele funny: rs2495090 C T X X
allele funny: rs6686602 G T X X
allele funny: rs97922 T G X X
allele funny: rs9645433 A G X X
allele funny: rs6678619 G A X X
allele funny: rs2149116 G A X X
allele funny: rs12097924 G T X X
allele funny: rs12566647 G A X X
allele funny: rs1772782 A G X X

Etc etc allele funny: rs etc etc

allele funny: rs9785861 G A X X
Killed
jalisciense@vbox:~/bin>

allele funny and killed message again.

Celtion · Nov 26, 2024

Jalisciense said:

I changed the -9 to 1

FAM001 ID001 0 0 1 1

And I run it:

Code:

jalisciense@vbox:~> cd bin
jalisciense@vbox:~/bin>  convertf -p par.PACKEDPED.PACKEDANCESTRYMAP
parameter file: par.PACKEDPED.PACKEDANCESTRYMAP
genotypename: mydata.bed
snpname: mydata.bim
indivname: mydata.fam
outputformat: PACKEDANCESTRYMAP
genotypeoutname: mydata.packedancestrymapgeno
snpoutname: mydata.snp
indivoutname: mydata.ind
familynames: NO
## convertf version: 5000
genetic distance set from physical distance
genotype file processed
numvalidind:      1  maxmiss:  1001
packedancestrymap output
##end of convertf run
jalisciense@vbox:~/bin>

I made a file: merge_param.par (not .txt) and this was inside:

geno1: v62.0_1240k_public.geno
snp1: v62.0_1240k_public.snp
ind1: v62.0_1240k_public.ind
geno2: mydata.packedancestrymapgeno
snp2: mydata.snp
ind2: mydata.ind
genooutfilename: mydata_merged_with_1240k.geno
snpoutfilename: mydata_merged_with_1240k.snp
indoutfilename: mydata_merged_with_1240k.ind

So I run it:

Code:

jalisciense@vbox:~> cd bin
jalisciense@vbox:~/bin> mergeit -p merge_param.par
parameter file: merge_param.par
geno1: v62.0_1240k_public.geno
snp1: v62.0_1240k_public.snp
ind1: v62.0_1240k_public.ind
geno2: mydata.packedancestrymapgeno
snp2: mydata.snp
ind2: mydata.ind
genooutfilename: mydata_merged_with_1240k.geno
snpoutfilename: mydata_merged_with_1240k.snp
indoutfilename: mydata_merged_with_1240k.ind
allele funny: rs2679218 T C X X
allele funny: rs7418365 A G X X
allele funny: rs2495090 C T X X
allele funny: rs6686602 G T X X
allele funny: rs97922 T G X X
allele funny: rs9645433 A G X X
allele funny: rs6678619 G A X X
allele funny: rs2149116 G A X X
allele funny: rs12097924 G T X X
allele funny: rs12566647 G A X X
allele funny: rs1772782 A G X X

Etc etc allele funny: rs etc etc

allele funny: rs9785861 G A X X
Killed
jalisciense@vbox:~/bin>

allele funny and killed message again.

Okay try adding the following parameters in the par file and then merge again.
docheck: YES
strandcheck: YES

Also you can keep the PAM files as just .geno suffix. It will know that it’s PAM format.

Jalisciense · Nov 26, 2024

Celtion said:
Okay try adding the following parameters in the par file and then merge again.
docheck: YES
strandcheck: YES

Also you can keep the PAM files as just .geno suffix. It will know that it’s PAM format.

Ok!

Code:

jalisciense@vbox:~> cd bin
jalisciense@vbox:~/bin> mergeit -p merge_param.par
parameter file: merge_param.par
geno1: v62.0_1240k_public.geno
snp1: v62.0_1240k_public.snp
ind1: v62.0_1240k_public.ind
geno2: mydata.geno
snp2: mydata.snp
ind2: mydata.ind
genooutfilename: mydata_merged_with_1240k.geno
snpoutfilename: mydata_merged_with_1240k.snp
indoutfilename: mydata_merged_with_1240k.ind
docheck: YES
strandcheck: YES
allele funny: rs2679218 T C X X
allele funny: rs7418365 A G X X
allele funny: rs2495090 C T X X
allele funny: rs6686602 G T X X
allele funny: rs97922 T G X X
allele funny: rs9645433 A G X X
allele funny: rs6678619 G A X X
allele funny: rs2149116 G A X X
allele funny: rs12097924 G T X X
allele funny: rs12566647 G A X X
allele funny: rs1772782 A G X X
allele funny: rs2391467 A G X X
allele funny: rs2222748 T C X X
allele funny: rs2064901 A G X X
allele funny: rs6694580 G T X X
allele funny: rs1130183 G A X X
allele funny: rs1409814 A G X X
allele funny: rs2181289 T C X X
allele funny: rs1875431 C T X X
allele funny: rs664855 T C X X
allele funny: rs789358 T G X X

etc etc allele funny: rs etc etc etc

allele funny: rs9785861 G A X X
read 1073741824 bytes
read 2147483648 bytes
read 3221225472 bytes
read 4294967296 bytes
read 5368709120 bytes
read 5435121304 bytes
packed geno read OK
end of inpack
packed geno read OK
end of inpack
Killed
jalisciense@vbox:~/bin>

Celtion · Nov 26, 2024

Jalisciense said:

Ok!

Code:

jalisciense@vbox:~> cd bin
jalisciense@vbox:~/bin> mergeit -p merge_param.par
parameter file: merge_param.par
geno1: v62.0_1240k_public.geno
snp1: v62.0_1240k_public.snp
ind1: v62.0_1240k_public.ind
geno2: mydata.geno
snp2: mydata.snp
ind2: mydata.ind
genooutfilename: mydata_merged_with_1240k.geno
snpoutfilename: mydata_merged_with_1240k.snp
indoutfilename: mydata_merged_with_1240k.ind
docheck: YES
strandcheck: YES
allele funny: rs2679218 T C X X
allele funny: rs7418365 A G X X
allele funny: rs2495090 C T X X
allele funny: rs6686602 G T X X
allele funny: rs97922 T G X X
allele funny: rs9645433 A G X X
allele funny: rs6678619 G A X X
allele funny: rs2149116 G A X X
allele funny: rs12097924 G T X X
allele funny: rs12566647 G A X X
allele funny: rs1772782 A G X X
allele funny: rs2391467 A G X X
allele funny: rs2222748 T C X X
allele funny: rs2064901 A G X X
allele funny: rs6694580 G T X X
allele funny: rs1130183 G A X X
allele funny: rs1409814 A G X X
allele funny: rs2181289 T C X X
allele funny: rs1875431 C T X X
allele funny: rs664855 T C X X
allele funny: rs789358 T G X X

etc etc allele funny: rs etc etc etc

allele funny: rs9785861 G A X X
read 1073741824 bytes
read 2147483648 bytes
read 3221225472 bytes
read 4294967296 bytes
read 5368709120 bytes
read 5435121304 bytes
packed geno read OK
end of inpack
packed geno read OK
end of inpack
Killed
jalisciense@vbox:~/bin>

What did you use to convert your Ancestry file to 23andMe format?

Jalisciense · Nov 26, 2024

Celtion said:
What did you use to convert your Ancestry file to 23andMe format?

I downloaded my raw data and strip my AncestryDNA header up to rsid chromosome position allele1 allele 2:

And it stayed that way:

Then I sent it to my virtual machine (Where I have AdmixTools) and I used this command to convert it to 23andMe format:

awk 'BEGIN {FS="\t"};{print $1"\t"$2"\t"$3"\t"$4 $5}' AncestryDNA.txt > AncestryCombined.txt

So the raw data was like this now:

Then I converted it to Plink using this command:

plink --23file AncestryCombined.txt --make-bed --out mydata

And I received mydata.bed, mydata.bim, mydata.fam, mydata.hh, mydata.log files.

The rest of the steps are in this thread where I installed Eigentools and I converted my data files to PACKEDANCESTRYMAP and later trying to merge them with the AADR.

Did I do everything right? Or did I screw up somewhere?

Celtion · Nov 27, 2024

Jalisciense said:
I downloaded my raw data and strip my AncestryDNA header up to rsid chromosome position allele1 allele 2:

And it stayed that way:

Then I sent it to my virtual machine (Where I have AdmixTools) and I used this command to convert it to 23andMe format:

awk 'BEGIN {FS="\t"};{print $1"\t"$2"\t"$3"\t"$4 $5}' AncestryDNA.txt > AncestryCombined.txt

So the raw data was like this now:

Then I converted it to Plink using this command:

plink --23file AncestryCombined.txt --make-bed --out mydata

And I received mydata.bed, mydata.bim, mydata.fam, mydata.hh, mydata.log files.

The rest of the steps are in this thread where I installed Eigentools and I converted my data files to PACKEDANCESTRYMAP and later trying to merge them with the AADR.

Did I do everything right? Or did I screw up somewhere?

To be on the safe side just use this free converter to go from Ancestry to 23andMe format – that’s what I use: https://tendna.com/en/dna-converter

Then convert to Plink again but don’t use the --make-bed flag. The format is this:
plink --23file <dnafile>.txt <familyname> <individualname/ID> --out <dnafile>

Then use convertf to convert to PAM. When you use mergeit put your file as the first input and the AADR second. Also you can add the following to the par file:
outputformat: PACKEDANCESTRYMAP

Jalisciense · Nov 27, 2024

Celtion said:
To be on the safe side just use this free converter to go from Ancestry to 23andMe format – that’s what I use: https://tendna.com/en/dna-converter

Then convert to Plink again but don’t use the --make-bed flag. The format is this:
plink --23file <dnafile>.txt <familyname> <individualname/ID> --out <dnafile>

Then use convertf to convert to PAM. When you use mergeit put your file as the first input and the AADR second. Also you can add the following to the par file:
outputformat: PACKEDANCESTRYMAP

Edit:

I think you mean the command like this?

plink --23file 23andme.txt 001 001--out mydata

Because I run it and I got this:

Code:

jalisciense@vbox:~/bin> plink --23file 23andme.txt 001 001--out mydata


PLINK v1.9.0-b.7.7 64-bit (22 Oct 2024) cog-genomics.org/plink/1.9/


(C) 2005-2024 Shaun Purcell, Christopher Chang GNU General Public License v3


Logging to plink.log.


Options in effect:


  --23file 23andme.txt 001 001--out mydata


Error: Invalid --23file sex parameter (M or 1 = male, F or 2 = female,


I = infer from data, 0 = force missing).


jalisciense@vbox:~/bin>

So I changed to:

plink --23file 23andme.txt 1--out mydata

And I got this:

Code:

jalisciense@vbox:~/bin> plink --23file 23andme.txt 1--out mydata


PLINK v1.9.0-b.7.7 64-bit (22 Oct 2024) cog-genomics.org/plink/1.9/


(C) 2005-2024 Shaun Purcell, Christopher Chang GNU General Public License v3


Logging to plink.log.


Options in effect:


  --23file 23andme.txt 1--out mydata


5886 MB RAM detected; reserving 2943 MB for main workspace.


--23file: plink.bed + plink.bim + plink.fam written.


Inferred sex: male.


jalisciense@vbox:~/bin>

I received 4 files: plink.bed, plink.bim, plink.fam and plink.log (I didn't named "plink", when I received the files automatically were named them like that.

So I open the .fam file and the content was this:

1--out mydata 0 0 1 -9

So it's not correct right?

Celtion · Nov 27, 2024

Jalisciense said:
Ok I understand everything you wrote except for this, what do you mean?

<familyname> <individualname/ID>

What I have to write there? My real name and surname or what lol

So for example the command would be like this?:

plink --23file 23andme.txt Smith John --out mydata

yes or whatever you want to call yourself. These will become the first two columns of your sample info in the .fam file.

Jalisciense · Nov 27, 2024

Celtion said:
yes or whatever you want to call yourself. These will become the first two columns of your sample info

I edited my post.

But I did what you told me and it didn't work with this command:

plink --23file 23andme.txt smith john--out mydata

I run it:

Code:

jalisciense@vbox:~> cd bin

jalisciense@vbox:~/bin> plink --23file 23andme.txt smith john--out mydata
PLINK v1.9.0-b.7.7 64-bit (22 Oct 2024) cog-genomics.org/plink/1.9/
(C) 2005-2024 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to plink.log.
Options in effect:
  --23file 23andme.txt smith john--out mydata

Error: Invalid --23file sex parameter (M or 1 = male, F or 2 = female,
I = infer from data, 0 = force missing).
jalisciense@vbox:~/bin>

So I changed it to 001 and 001, this command:

plink --23file 23andme.txt 001 001--out mydata

I run it and I got this:

Code:

jalisciense@vbox:~/bin> plink --23file 23andme.txt 001 001--out mydata
PLINK v1.9.0-b.7.7 64-bit (22 Oct 2024) cog-genomics.org/plink/1.9/
(C) 2005-2024 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to plink.log.
Options in effect:
  --23file 23andme.txt 001 001--out mydata

Error: Invalid --23file sex parameter (M or 1 = male, F or 2 = female,
I = infer from data, 0 = force missing).
jalisciense@vbox:~/bin>

So I changed to:

plink --23file 23andme.txt 1--out mydata

And I got this:

Code:

jalisciense@vbox:~/bin> plink --23file 23andme.txt 1--out mydata
PLINK v1.9.0-b.7.7 64-bit (22 Oct 2024) cog-genomics.org/plink/1.9/
(C) 2005-2024 Shaun Purcell, Christopher Chang GNU General Public License v3
Logging to plink.log.
Options in effect:
  --23file 23andme.txt 1--out mydata

5886 MB RAM detected; reserving 2943 MB for main workspace.
--23file: plink.bed + plink.bim + plink.fam written.
Inferred sex: male.
jalisciense@vbox:~/bin>

I received 4 files (Not 5): plink.bed, plink.bim, plink.fam and plink.log (I didn't named "plink", when I received the files automatically were named them like that).

So I open the .fam file and the content was this:

1--out mydata 0 0 1 -9

So it's not correct right?

Celtion · Nov 27, 2024

Jalisciense said:
I edited my post.

But I did what you told me and it didn't work with this command:

plink --23file 23andme.txt smith john--out mydata

I run it:

Code:

jalisciense@vbox:~> cd bin jalisciense@vbox:~/bin> plink --23file 23andme.txt smith john--out mydata PLINK v1.9.0-b.7.7 64-bit (22 Oct 2024) cog-genomics.org/plink/1.9/ (C) 2005-2024 Shaun Purcell, Christopher Chang GNU General Public License v3 Logging to plink.log. Options in effect: --23file 23andme.txt smith john--out mydata Error: Invalid --23file sex parameter (M or 1 = male, F or 2 = female, I = infer from data, 0 = force missing). jalisciense@vbox:~/bin>

So I changed it to 001 and 001, this command:

plink --23file 23andme.txt 001 001--out mydata

I run it and I got this:

Code:

jalisciense@vbox:~/bin> plink --23file 23andme.txt 001 001--out mydata PLINK v1.9.0-b.7.7 64-bit (22 Oct 2024) cog-genomics.org/plink/1.9/ (C) 2005-2024 Shaun Purcell, Christopher Chang GNU General Public License v3 Logging to plink.log. Options in effect: --23file 23andme.txt 001 001--out mydata Error: Invalid --23file sex parameter (M or 1 = male, F or 2 = female, I = infer from data, 0 = force missing). jalisciense@vbox:~/bin>

So I changed to:

plink --23file 23andme.txt 1--out mydata

And I got this:

Code:

jalisciense@vbox:~/bin> plink --23file 23andme.txt 1--out mydata PLINK v1.9.0-b.7.7 64-bit (22 Oct 2024) cog-genomics.org/plink/1.9/ (C) 2005-2024 Shaun Purcell, Christopher Chang GNU General Public License v3 Logging to plink.log. Options in effect: --23file 23andme.txt 1--out mydata 5886 MB RAM detected; reserving 2943 MB for main workspace. --23file: plink.bed + plink.bim + plink.fam written. Inferred sex: male. jalisciense@vbox:~/bin>

I received 4 files (Not 5): plink.bed, plink.bim, plink.fam and plink.log (I didn't named "plink", when I received the files automatically were named them like that).

So I open the .fam file and the content was this:

1--out mydata 0 0 1 -9

So it's not correct right?

You typed it in wrong. Make sure there’s a space before the --out flag.

Jalisciense · Nov 27, 2024

Celtion said:
You typed it in wrong. Make sure there’s a space before the --out flag.

ok, so here we go:

I used the link of free DNA file converter: https://tendna.com/en/dna-converter

The option of raw data was from AncestryDNA to 23andMe format:

Then I strip my AncestryDNA header up to rsid chromosome position allele1 allele 2:

And now it looked like this:

(I sent it to my virtual machine because I have there Admixtools, Plink and Eigentools).

I convert my 23andmelike file to Plink using this command:

plink --23file 23andmelike.txt Smith John --out mydata

(My name is not Smith John, I just write as an example).

Then I run it:

So I got 4 files: mydata.bed, mydata.bim, mydata.fam and mydata.log

I opened the mydata.fam and I changed manually the -9 to 1:

Smith John 0 0 1 -9 -> Smith John 0 0 1 1

Then I converted Plink format files to PACKEDANCESTRYMAP format, but first I used to have familynames: NO in the par.PACKEDPED.PACKEDANCESTRYMAP file but I changed to familynames: YES

It looks like this:

genotypename: mydata.bed
snpname: mydata.bim
indivname: mydata.fam
outputformat: PACKEDANCESTRYMAP
genotypeoutname: mydata.geno
snpoutname: mydata.snp
indivoutname: mydata.ind
familynames: YES

I run it with the command:

convertf -p par.PACKEDPED.PACKEDANCESTRYMAP

I received 3 new files: mydata.geno, mydata.ind and mydata.snp

Now I tried to merged them with the AADR files (Both are in the PACKEDANCESTRYMAP format), but adding 3 parameters in the merge_param.par file, besides the geno1, snp1 and ind1 were mydata files and geno2, snp2 and ind2 were the AADR ones.

Like this:

geno1: mydata.geno
snp1: mydata.snp
ind1: mydata.ind
geno2: v62.0_1240k_public.geno
snp2: v62.0_1240k_public.snp
ind2: v62.0_1240k_public.ind
genooutfilename: mydata_merged_with_1240k.geno
snpoutfilename: mydata_merged_with_1240k.snp
indoutfilename: mydata_merged_with_1240k.ind

outputformat: PACKEDANCESTRYMAP
docheck: YES
strandcheck: YES

So I run this command:

mergeit -p merge_param.par

So something went wrong again, but now the "allele funny" did not appear, so this time what did it happen?

Celtion · Nov 27, 2024

Jalisciense said:
ok, so here we go:

I used the link of free DNA file converter: https://tendna.com/en/dna-converter

The option of raw data was from AncestryDNA to 23andMe format:

Then I strip my AncestryDNA header up to rsid chromosome position allele1 allele 2:

And now it looked like this:

(I sent it to my virtual machine because I have there Admixtools, Plink and Eigentools).

I convert my 23andmelike file to Plink using this command:

plink --23file 23andmelike.txt Smith John --out mydata

(My name is not Smith John, I just write as an example).

Then I run it:

So I got 4 files: mydata.bed, mydata.bim, mydata.fam and mydata.log

I opened the mydata.fam and I changed manually the -9 to 1:

Smith John 0 0 1 -9 -> Smith John 0 0 1 1

Then I converted Plink format files to PACKEDANCESTRYMAP format, but first I used to have familynames: NO in the par.PACKEDPED.PACKEDANCESTRYMAP file but I changed to familynames: YES

It looks like this:

genotypename: mydata.bed
snpname: mydata.bim
indivname: mydata.fam
outputformat: PACKEDANCESTRYMAP
genotypeoutname: mydata.geno
snpoutname: mydata.snp
indivoutname: mydata.ind
familynames: YES

I run it with the command:

convertf -p par.PACKEDPED.PACKEDANCESTRYMAP

I received 3 new files: mydata.geno, mydata.ind and mydata.snp

Now I tried to merged them with the AADR files (Both are in the PACKEDANCESTRYMAP format), but adding 3 parameters in the merge_param.par file, besides the geno1, snp1 and ind1 were mydata files and geno2, snp2 and ind2 were the AADR ones.

Like this:

geno1: mydata.geno
snp1: mydata.snp
ind1: mydata.ind
geno2: v62.0_1240k_public.geno
snp2: v62.0_1240k_public.snp
ind2: v62.0_1240k_public.ind
genooutfilename: mydata_merged_with_1240k.geno
snpoutfilename: mydata_merged_with_1240k.snp
indoutfilename: mydata_merged_with_1240k.ind

outputformat: PACKEDANCESTRYMAP
docheck: YES
strandcheck: YES

So I run this command:

mergeit -p merge_param.par

So something went wrong again, but now the "allele funny" did not appear, so this time what did it happen?

Reconvert the ancestry to 23andMe at the website but this time don't remove the header! mergeit is programmed to fail if it detects changes that have been made to the file. Then proceed as before.

Jalisciense · Nov 27, 2024

Celtion said:
Reconvert the ancestry to 23andMe at the website but this time don't remove the header! mergeit is programmed to fail if it detects changes that have been made to the file. Then proceed as before.

I reconverted it, I did not modify anything (I even did not change the name of the file).

I followed all the steps and when I tried to merged, appeared like this:

Celtion · Nov 27, 2024

Jalisciense said:
I reconverted it, I did not modify anything (I even did not change the name of the file).

I followed all the steps and when I tried to merged, appeared like this:

okay, add the following parameters to the mergeit par file:
hashcheck: NO
allowdups: YES

See how that goes.

Jalisciense · Nov 27, 2024

Celtion said:
okay, add the following parameters to the mergeit par file:
hashcheck: NO
allowdups: YES

See how that goes.

Ok, adding the 2 parameters you recommended:

Celtion · Nov 27, 2024

Jalisciense said:
Ok, adding the 2 parameters you recommended:

okay lets go back to basics and get rid of all the optional parameters and stick with the original 9 and see what happens.

Jalisciense · Nov 27, 2024

Celtion said:
okay lets go back to basics and get rid of all the optional parameters and stick with the original 9 and see what happens.

Hey bro, a question, when you merged your data with AADR, did you use the 9 original parameters? Or did you add more?

Celtion · Nov 27, 2024

Jalisciense said:
Hey bro, a question, when you merged your data with AADR, did you use the 9 original parameters? Or did you add more?

I only added the outputformat parameter to the main 9 however I have always had it located between the 2nd fileset and the merged fileset – in other words on the 7th line. Also I didn't have any empty lines, so it was just 10 lines with no line spaces in between.
You could try that and if it fails again, try converting your 23andMe file to Plink again but add the --keep-allele-order flag and redo the rest and see if that helps.

Other than that I'm running out of ideas. Some of these issues can be a needle in a haystack. Originally I couldn't get Eigensoft to compile on my Mac until I found a post where someone showed the code that needed editing.

If all fails you might have to convertf the AADR to Plink and then merge in Plink. I found the v62.0 to be more stable than previous releases so I was able to merge in Plink quite okay however a longer procedure.

Jovialis · Nov 27, 2024

Now THAT's what I call a PCA:

https://twitter.com/x/status/1861801542958911684

Admixtools admixtools2 TUTORIAL for WINDOWS.

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Regular Member

Advisor