Archiver > GENEALOGY-DNA > 2010-02 > 1267301131

From: "Lancaster-Boon" <>
Subject: [DNA] Call to participants in ALL geographicandhaplogroupprojects:fill in your ancestry
Date: Sat, 27 Feb 2010 21:05:31 +0100

Hi Kirsten

Splitting up the data is one possibility I guess we have to constantly keep
in mind, but I understand that in the past this could not be sustained.
Maybe we can set up a control somehow to do this better and compartmentalize
the work somehow.

Actually it would be easy to do this if the database were not in itself so
big. Pasting 4400 individuals into a spreadsheet and then playing around
using sort and filter does not sound like a good idea. It gets mentioned to
me every now and then that big projects require a knowledge of SQL. Gulp.

Even then I think there are many practical challenges I hope FT DNA is
looking at. If you want to make your own big database then things like
furthest back ancestor, SNP tests, etc are not I think available in any
tabular form for admins? (CSV or otherwise.) It is quite often on the E-M35
project that people ask what the latest SNP results are for a new SNP for
example, but to answer such a questions means being able to sort out not
only positives but also negatives. But for many such jobs the only way to do
is to manually open each personal account and go through it.

Or are there tricks I have not learnt?

Best Regards

Subject: Re: [DNA] Call to participants in ALL
geographicandhaplogroupprojects: fill in your ancestry
Date: Sat, 27 Feb 2010 18:02:02 +0000 (UTC)
In-Reply-To: <DD1919274EA94023B7ED263A0784AF4C@PC>

I do see a value to including data from men with brick walls or limited
information about the places of origin for their most distant known
ancestors in some databases. One example would be members of small clades
that are tied to specific geographic areas, but whose signatures would be
hard to recognize if you cut your sample size down to those few with paper
trails to specific counties. I think that others could come up with other
examples of ways that including data from men lacking specific information
about their origins in databases for men whose lines may originate from a
particular geographical area would be beneficial.

Of course you wouldn't have to do that with the British Isles project, and
you might not have the time or the support from other administrators needed
to handle a project of the size that a very open British Isles project would
be. It really would be nice to find a way to make such a project happen, or
to at least have a couple of projects whose databases could be combined by
researchers interested in putting the haplotype puzzle pieces from men with
and without information about origins in specific British Isles counties

This thread: