Thursday, December 30, 2021

Antenati's New Site Design - is it Actually Better?

Not too long ago, the Italian Archives website, Antenati, or "Ancestors Portal" got a face lift. At first, everyone raved about what an improvement it was, and admittedly, the ability to find and navigate to the records you're looking for has been a great improvement. Unfortunately, it has come at the cost of the Archives no longer supplying an inherent way to download full resolution images, which means we can't save copies of the records for our personal reference. We can take a screenshot, but to get the whole document, it will be too small to read. And if we zoom in to take a screenshot, we won't get the full document.

There is a way around this - but it's basically a hack, and who knows if it will remain available forever. It's also complicated and includes several steps involving the html code. But if you're brave enough, here's how to do it:

Step 1: Navigate to the image you wish to download, and click the icon with 3 horizontal lines located in the upper left corner of the image viewer window (see screenshot below, the icon is highlighted in yellow, click to enlarge).

Step 2: This will bring up a side bar on the left with information. Note the page number listed here (highlighted in yellow in screenshot below), because you'll need that later.

Step 3: Scroll down the side bar to the bottom where you'll see a link just below where it says "IIIF manifest". Click the link (highlighted in yellow in the screenshot below).

Step 4: Here's where it gets tricky. The link opens a page with a bunch of html coding. Different browsers seem to display it differently - if you're lucky, it will be organized with nested lines and different colors, making it easier to find what you're looking for, and the URLs will be clickable links. If you're unlucky like me, you'll see a big long block of text/coding with no links, no colors (shown below). What you're look for first is the page number you took note of in step 2. In the code, it will say "label":"pag. 31" (or whichever page number you're looking for). If you're having trouble finding it, you can use your browser's "Find" or "Find in Page" option to search for it (the screenshot below shows the page number 31 highlighted because I searched for it).

Step 5: Look just above your page number in the code for a URL that looks like this: (URL is highlighted in grey and shows relation to the page number in the screenshot below) - the part that says "wrZgxjz" in my URL will be different for you. That's okay, that's what you want. That's the specific image code you're looking for. Copy and paste the whole URL (or click on it if it's clickable) into a new browser tab.

Step 6: If you're unlucky like me and the URL you copied and pasted includes duplicate slashes so you're getting a "Page not found" result, remove the duplicate slashes. The URL should look like this:, not like this: https:\/\/\/iiif\/2\/wrZgxjz\/full\/full\/0\/default.jpg or like this: If it's annoying to delete all those extra slashes every time, you can always just bookmark the proper URL and then just copy and paste the image code into the URL.

Step 7: Once you get the correct image to load, you can right click it and save the full resolution image.

Although the new site might be faster and easier to navigate, the inability to save crucial documents (which you'd think was the entire purpose of the site) is a huge step backwards. This hack is cumbersome, but for now, it's the only option. Good luck.

Saturday, December 18, 2021

Outlander and the Development of Surnames

For the record, I have never read the Outlander book series, but I have watched the TV show, and as a genealogist, there was one scene in particular about surnames that struck me as hugely unrealistic. It was the suggestion that a somewhat abandoned child never had a surname. There's a few reasons why this seems rather ridiculous to me.

This isn't really a spoiler, because it's not really a scene that's crucial to the plot, though I guess it's relevant to character development, I still don't consider it very important but other's might, so you've been warned, read at your own risk.

In the TV show, our male lead character, Jamie, comes across a young pickpocket, Claudel, while in Paris, who never knew who his parents were (his father was unknown, and his mother was probably one of the prostitutes at the brothel where he lives, but he's never been told which one). They have a conversation in which they both agree Claudel isn't a very good name, suggesting that it isn't a very strong or masculine name. Jamie recommends the name Fergus instead, and Claudel, now Fergus, agrees, and they return to Scotland with Fergus as Jamie's foster child. Fast forward to when Fergus is all grown up, now a young man marrying his sweetheart (who happens to be Jamie's step-daughter - it's best not to examine that too closely), and the officiant asks what his last name is. Everyone pauses as they remember he doesn't have a last name, before Jamie graciously steps in and gives Fergus his own surname, Fraser. It was a touching acknowledgement of their father/son relationship. While it's maybe understandable Fergus may not have been given a surname at birth (though even this is a stretch, more on that later), it doesn't make much sense that by this point in his life, he would not have developed an informal surname that he likely would have used on a marriage record.

Fergus gets married with Jamie's surname

Remember, at this point in history, there were no birth certificates and forms of identification. There were parish records, so there would have been a baptism record, but people didn't keep or travel with copies of their baptism record for identification. Names were more fluid and the concept of an official or legal name hadn't really developed yet. So even though no one gave him a surname at birth doesn't mean one wouldn't have developed over time. Claudel was probably a fairly common name in France at the time, and Paris was a big city, so there were likely other young boys named Claudel, and people would have needed a way to identify or distinguish them. If one of them didn't have a last name, they would have used a description, which would have then naturally been shortened overtime into a surname. Especially as Claudel/Fergus was a bit of a troublemaker, certainly, people would have been talking about him periodically, and needed a way to identify him.

The same would have happened in Scotland, if it never had a chance to happen in Paris, or if the names didn't follow him from France. There was likely more than one Fergus in the area of Scotland they lived in, so people would have identified him in the most obvious ways, probably either something like, "Jamie's ward", "Jamie's foster child", or "the French boy." Overtime, they likely would have gotten shortened to just "Ward", "Foster", or "French," and all three of them could have even been in use at the same time. Therefore, when asked what his surname was, he probably would have picked whichever one of the three he liked better, or was more common. When you think about it, these three names are all very real surnames in use today, and they sometimes come from situations exactly like this one. 

But that fact doesn't have to have ruined the moment. A name like "Ward" or "Foster" kind of identifies one's undesirable origins (and a name like French among a bunch of Scots kind of does too), so it could have been a source of pain or embarrassment for Fergus, and Jamie still could have stepped in and said, "No, it's not Ward, it's Fraser."

Also, let's not forget how easy it was to change or make up a new name. Again, there were no birth certificates and remember how Claudel simply changed his name to Fergus just by deciding that's what he wanted to be called now? So, if he's able to just make up a new first name, there's no reason he couldn't have just made up a surname at the same time. I'm not saying he definitely would have done that, just that if his lack of a surname was a source of embarrassment for him, he could have just given himself a surname, and picked whatever he wanted. This idea that he could change his first name on a whim, but oh no, he's stuck without a surname his whole life until Jamie steps in to save the day, is a little bit silly. 

And I'll even go so far as to say the fact that he didn't have a surname at birth was silly to begin with. Children born out of wedlock were typically given their mother's surname. Fergus didn't know who his mother was, which is weird to begin with, because she's supposedly the reason he was living at the brothel (so was he even actually "abandoned" to begin with if he was living with his mother?). I've been told that he had no surname because "nobody cared enough about him to give him a surname". But what about the fact that someone cared enough about him to let him live at the brothel that his mother worked at? Someone cared enough about him to feed and clothe him, a financial burden to them, when they didn't have to, when they could have dropped him off at an orphanage or foundling hospital (the first of which opened in Paris in 1670), which would have taken him in, and given him a surname. Foundlings were a common part of history, and they were given surnames by this point in time because by the 18th century, surnames had long been in full use in France and Western Europe. But that's not what happened. He was kept at the brothel, supposedly because someone wanted him there, probably his mother, so why would she not identify herself and give him her surname? Even if she died when he was an infant, someone else at the brothel must have cared enough to keep him there even when they didn't have to, and that person surely would have known who his mother was and told him, and therefore he would have used her name. 

So it just doesn't make any sense that even an abandoned child in 18th century Western Europe would not have a surname to begin with, and that even if for some reason he didn't, that one wouldn't have developed out of a nickname over the course of his young life. 

Friday, October 8, 2021

What was Immigration Like in History?

Part of understanding our ancestor's journey is understanding the laws and regulations at the time. The topic of immigration laws and restrictions, and when they began is one that comes up frequently. There's a very good Timeline of Major US Immigration Laws from 1790 - 2006, but it also includes a lot of laws on naturalization, deportation, and changes to enforcements of these laws. So, here's my breakdown of the most important parts:

The first federal immigration act to restrict who could come into the country was the Page Act of 1875, which prohibited the entry of Chinese women, which is less racist than it sounds, it was an attempt to reduce the amount of Chinese women being trafficked into the country for prostitution. In 1870, roughly half of Chinese women in the US were prostitutes, keeping in mind many of those not recorded as prostitutes were children, so it's probably safe to say most adult Chinese women in the US were prostitutes.

It was shortly followed by the much more significant Immigration Act of 1882 and the Chinese Exclusion Act. The Immigration Act banned anyone considered a "convict, lunatic, idiot, or person unable to take care of himself or herself without becoming a public charge." You may think that would only affect the mentally or physically disabled, or sick, but it could also include pregnant or single women considered to not have financial support from a man, like a father, brother, husband, etc. It also put a tax of 50 cents on each incoming foreigner, to be paid by the ship's owner. I imagine this may have driven up the cost of passenger tickets, making it harder (but not impossible) for the most destitute of people to immigrate, though I can't say that was the intention - the intention was merely to raise funds for the regulation of immigration.

The Chinese Exclusion Act was pretty much as racist as it sounds, it suspended immigration of Chinese laborers for 10 years. Over the next 20-some years, there continued to be additional laws that further restricted Chinese immigration, and allowed for easier deportation of existing Chinese residents, such as the 1888 Scott Act, which prohibited lawful Chinese residences from returning to the US if they left. Also, the 1892 Act to Prohibit the Coming of Chinese Persons into the United States aka the "Geary Act", which required all Chinese citizens living in the US to obtain certificates proving their lawful residence, and any Chinese person found unlawfully living in the US, instead of being deported, could be imprisoned and sentenced to one year of hard labor. Fortunately, the hard labor part was deemed unconstitutional by the Supreme Court in 1896 under the 5th Amendment, but it was probably a small win for the Chinese who continued to be harshly discriminated against.

Also worth noting is that although there was never a law with specifics, from 1892 to 1924, the determination of who was likely to become a public charge was left to the discretion of immigration officers, who typically required immigrants have a certain amount of money at the time of immigration. The amounts varied by the immigration officer, but if you did not meet their requirement, you could be deported. How unfair to not have a set national limit on this so immigrants were prepared and could save up enough money. However, there was a backup option if you didn't have enough money: there were immigrant aid societies who would sometimes cover the missing required funds. Some people refer to this as a "sponsorship" but sponsors as they are today (someone who promises to support an immigrant and be financially responsible for them until they get a job and can support themselves) were not required until during the Great Depression in the 1930s.

In 1917, the harsh laws restricting Chinese immigration were extended to essentially most of Asia and the Middle East with the 1917 Immigration Act. It also prevented immigration of anarchists, persons previously deported, and all individuals over 16 who are deemed "physically capable of reading" but who cannot read (of any origin).

But basically, if you were literate, not a convict, not likely to become a public charge, and not Asian, there were no restrictions on your entry up to this point. That changed starting in 1921, and became even worse in 1924.

The 1921 Emergency Quota Act was the first to set "quotas" based on nationality, meaning it limited the number of immigrants from each foreign nation allowed in. The limit was set to 3% of the number of existing residences from those nations based on the 1910 census. So if there were 1,360,000 people on the 1910 census who were born in Italy, only 3% of that, or 40,800 Italians would be allowed to immigrate per year. This significantly reduced the amount of overall immigrants into the country.

In 1924, the quota was reduced to a mere 2% with the 1924 National Origins Act, and more than that, it was based on the 1890 census instead of the 1910 census. It may seem like an odd choice to go with an older census year, as it would be less up to date, but it was a strategic, and arguably discriminatory decision, as it restricted southern and eastern Europeans more than others. Most southern and eastern Europeans immigrated after the 1890s, so basing the quota on the 1890 census meant their numbers were smaller, making their quota smaller. This was the first act to discriminate against certain Europeans (good thing all my Italian ancestors were here by 1914). There were exceptions: students, citizens of Western Hemisphere nations, people of certain occupations, and wives and children of US citizens were all exempt. The act also for the first time required visas be obtained abroad before entry.

Granted, in 1927 the discriminatory selection of the 1890 census was changed to the 1920 census. However, you can see the dramatic drop in immigration due to these two acts in the historical stats. In 1910-1919, the number of people immigrating from Europe was 4,985,411, but in 1920-1929, there were only 2,560,340. By the 1930s, there were only 444,399 immigrants from Europe. The Golden Age of Immigration was over.

The next major immigration changes came in the 1940s, when they actually allowed for more immigration due to the war. During the war, Mexican temporary agricultural workers were allowed in with the 1942 Bracero Agreement, and following the war, the War Brides Act (1945) accepted foreign wives and children of US soldiers into the country. Finally, the Displaced Persons Act of 1948 allowed for 200,000 people displaced from their homelands by Nazi persecution into the US, which actually doesn't seem like a huge number in comparison to the millions that were immigrating prior to 1921. 

In 1965, regulations shifted from the quota based system to one of united families. The 1965 Immigration and Nationality Act based admittance on immigrants relationship to a US citizen or lawful permanent resident family member or US employer. However, caps were placed on the total number of immigrants admitted each year in most family and employer based categories. Additionally, a limit of 120,000 was placed on the total number of permanent residents admitted from the Western Hemisphere.

There were also a number of refugee acts in the 1950s, 1960s, and 1970s, and I could go on up to modern times, but it becomes more and more complicated, and this is only a look at the history of immigration laws and regulation. Hopefully, it's helped you have a good understanding of your ancestor's experience in coming to the US, depending on the time period.


Sunday, September 19, 2021

How to Group Your DNA Matches to Help Break Down Brick Walls

How do you break down a brick wall with DNA? It's what everyone wants to know - after all, what is the point of getting a DNA test if the ethnicity report is unreliable? Everyone says the true value of the test is in your DNA matches, but how do you utilize them to actually be useful in your research? To break down brick walls? To do what paper research couldn't?

This sort of ties in with my instructions on how to find unknown biological ancestors with DNA, though that was targeted more at NPE or adoption situations. However, the same basic process and workflow can be applied to breaking down brick walls. In the past, I've detailed specific cases where I've used DNA to break down a brick wall, but some of them are a little unique - every situation might be a little different, and therefore might require a bit of a different process. But here's the basics. 

In my post about finding unknown biological ancestors, in Step 1, it says, "Look for your closest DNA match that you can't identify as being from another known branch of your tree."

But wait - how do we even get to the point of finding a match you can't identify? You do that by identifying and grouping as many matches as you can. This is how my workflow goes, it works best for me, your mileage may vary, but in my experience, this is how most people do it in some way or form. Some maybe use a spreadsheet and the "Leeds Method", but ultimately, it's just a matter of grouping your matches by what branch of your tree they belong to, and since AncestryDNA have a built in grouping tool, I find that works best for me.

Grouping your matches.

Step 1: Create a group for each "branch" of your tree. Which branches? I recommend a group for each of your sixteen 2nd great grandparents, unless any of those 2nd great grandparents were from the same specific location, or endogamous population, because they will be difficult to tell apart. For example, my 2nd great grandparents who both came from the same tiny town in Italy called Monteroduni got grouped together because I have no other branches from there, and since the town is so endogamous, it would be difficult to always tell them apart. So I just have one group for "Monteroduni". Don't group by broader locations, like country. I did that by grouping my other 2nd great grandparents together because they were both from Norway, but now I regret that because they came from totally different parts of Norway, so there's no endogamy between them. So although I recommend a group for each 2nd great grandparent, depending on your ancestry, you may want to sometimes group them differently. 

16 groups does mean that it will fill up a lot of your available groups, AncestryDNA only allows you a maximum of 24, so you will only have 8 groups left to do with whatever you want. So like I say, you may want to group them differently, but this is what worked best for me.

Step 2: Start at the top of your match list and work your way down. Do you recognize your top match? Or can you see from their tree (if they have one) what ancestor you share? Is there a ThruLines/common ancestor hint for them that you can verify? If you already know the match or can identify how you're related to them, mark the branch you share by adding them to a group you've created for that branch. Do not assume a shared surname alone is the source of your shared DNA, it must be an actual common ancestor.

You may also want to add a note of your common ancestors, so you can see who they are more easily, and also so you know there's identified common ancestors (though I also have a group for MRCA - matches that have identified a most recent common ancestor).

My top matches are all my Italian cousins, you can see how
I've grouped them and added our MRCA to notes

Step 3: Do the same for the next match, and the next - keep going until you can't identify a match. When that happens, look at your Shared Matches with that person. Are any of them the people you've already identified with a common ancestor? If so, they are likely also from the same branch (especially if there's more than one match they share from the same ancestor/branch), so add them to that same group. 

I don't know my MRCA with Bettye because she hasn't added a tree,
but I can tell she's from my Smith branch because she matches
several people who are confirmed Smith descendants

If they have a tree, even a tiny one, build on it until you can find the connection to the branch you know they are likely from (focus on lines that come from the same/nearby location). If you can't find a common ancestor, that's okay, leave them in that group and you can come back to them another time.

Step 4: Keep doing this, ideally for all your estimated 4th cousins and closer (20+ cM). That's a lot, I know (I currently have 1,048 matches that share 20+ cM with me). It takes time, it's a lot of work, but in the end you'll wind up with 3 types of matches: those with identified common ancestors, those who likely come from an identified branch, and those you have no clue how you're related, not even a potential branch.

What to do with these groups? 

This is where there will be some overlap in my instructions on finding an unknown biological ancestor. Look at the closest match that you haven't even been able to group into a certain likely branch (or a common ancestor). Even if they don't have a tree, that's okay - look at your Shared Matches with them and open any match that has a viable tree. Compare the trees - do any of them share an ancestor with each other that you don't recognize? If so, research that ancestor and build a tree for them, you may find it links up with yours somehow, maybe even by breaking down a brick wall, or that it leads to an NPE - when someone's parent(s) is/are not their biological parent(s).

Additionally, you can look at your closest match that you haven't identified a common ancestor with, but you have grouped them into a likely branch. If they have a tree, again, build on it, and keep researching until you can find a connection. See my case example of Emma Elizabeth Sherwood.

This method of grouping your matches to single out the ones you can't identify at all can help lead you to some enlightening revelations, but they tend to be rather random. You don't know what you're going to find, you don't know which brick wall it might break down. Even the matches you can group into a likely branch but you're still searching for the common ancestor might surprise you - in my example of Emma Elizabeth Sherwood (above), I knew the match was related to my Mills branch (Emma's husband), but I had no idea it would finally break down the Sherwood brick wall that had been blocking me for 12 years.

Other methods.

There's other methods of breaking down a brick wall with DNA, ones that are more targeted for a specific brick wall, but they heavily rely on the surname you're looking for not being a very common one. You basically just search your matches trees for the surname you're looking for, and then compare the trees of the matches in the results, looking for a common ancestor among them. It can work well when the name isn't common, because it's likely most of the matches in the results will be the ones you're looking for. But the more common the name is, the more matches there will be in the results that aren't related to the branch you're looking for. That's why this never worked with Emma Elizabeth Sherwood (in my above example), because Sherwood was too common of a surname, I only found her family by using the more random grouping method and not knowing where an unknown match would lead me.

The surname search method would be much more effective if AncestryDNA would offer a very simple feature: the option to search for a surname within a specific location. At the moment, you can search for a surname or location, but not a surname in a location. So you can search for Smith OR Christian County, Kentucky, and you can search for them both at the same time, but it will include results for match's trees that have either the surname Smith, OR the location Christian County, Kentucky. And even if the tree includes both, it's not necessarily for the same branch or ancestor, it might be their Jones branch that's from Christian County, Kentucky, while their Smith branch is from Pennsylvania. For common surnames, we need a way to narrow it down, and the best way to do that is by looking for surnames within a specific location. At the moment, we can only do that manually by searching for a surname, and going through each match in the results to see for ourselves if that branch is from the right location. If so, then we can look for a specific common ancestor. It's very time consuming, and the more common the surname is, the less realistic it is to go through all those matches manually, yet there's a very simple way to make it easier, if AncestryDNA would just listen to their customers.

The surname search works a lot better if it's not a common surname. I successfully used this method with the surname Deaves, and also a suspected maiden name of Brannin.

You can also search by just location, but this only really works if your ancestors are from a very small, unique town, especially where there's endogamy. In my above example about my 2nd great grandparents who came from a tiny Italian town called Monteroduni, it's safe to say that the town is so small and endogamous that anyone who has ancestry from Monteroduni is probably related. Certainly, any DNA match of mine that has ancestry from Monteroduni, it's safe to say that's very probably how we are related. So I can very easily search my matches trees for the location of Monteroduni and even if I can't find a common ancestor between us, most likely that's probably where our common ancestors were from. Brick walls are difficult with endogamy though, so that might be the most I'll ever be able to determine. Searching by location may not break down any brick walls in your tree, but it does help you identify and sort your matches into groups/branches, which can help you find other unknown matches that may lead to a brick wall.

Like I say, sometimes breaking down a brick wall with DNA can be unique to the situation. Sometimes you have to think about what you're looking for, and consider the best way to come at the problem. But this should give you the basics to get you started. Feel free to share your success stories!

Tuesday, September 7, 2021

ThruLines is not the enemy

I see a lot of skepticism out there about ThruLines, and some of it is warranted, because it is based on family trees, which can have errors that get copied multiple times. But that doesn't mean you should dismiss ThruLines entirely, there are ways to get reliable use out of it, and not just by finding records that confirm them. There are ways to use DNA to find biological relatives or break down brick walls in your tree even when there's no written records of the lineage, and ThruLines is just one tool that can help you do this.

It's basically a matter of probabilities. The more people you match who are descended from multiple siblings of your ancestor, especially when all those descendants all or mostly match each other to form a cluster, the less likely it becomes that it's an error. When the matches mostly all match each other to form a cluster, you know they are all related and descended from the same branch/ancestor - you just need to identify which branch/ancestor, which is where trees and ThruLines come in. Each sibling that those matches descend from would have to be an error for trees/ThruLines to be wrong, so the more siblings you match descendants of, the more likely the trees are accurate. If you match 20 people (who mostly all match each other too) descended from 5 siblings of your ancestor, what are the chances there's been an error in the trees for each of those 5 siblings, plus your own ancestor? Extremely unlikely. In the example above (click to enlarge), there's 41 matches descended from 8 siblings of Elizabeth Mertz, so for this all to be wrong, there would have to be 9 different errors. This amount of evidence is really very conclusive, and I can probably confirm this family now.

Even assuming there's only one error and those siblings are indeed siblings to each other, but your ancestor is the lone error, and not actually their sibling, what are the chances you would match that many people from a certain family, if you weren't related to that family somehow? Using the example above again, what are the chances I match 41 people descended from those 8 siblings, if Elizabeth Mertz is not one of their siblings? Again, it's very unlikely - and the only way this would be possible is if there was a lot of endogamy involved, but even so, it would still be pointing you towards a specific population you're likely descended from (and matching surnames from the same endogamous population means you're probably related to that specific family somehow), so you don't want to dismiss it.

Granted, it doesn't confirm who exactly the parents of those siblings are, only that they are indeed siblings. For that, you'd have to go up another generation and do the same thing - look for people descended from siblings of the alleged father and mother. In the example above, it doesn't really confirm that Phillip Mertz is the father of Elizabeth and all her siblings, only that they are siblings from the same parent(s), whoever that may be. But for now, it's probably safe to add Phillip Mertz at least as a placeholder until more research can be done (it really is okay to add speculative data to your tree as long as you know it's speculative!).

In the example below, you can see how this ThruLines doesn't confirm descent from Benjamin Butler - the 6 DNA matches are descendants of children of David Butler, so this really doesn't confirm this potential ancestor at all.

And there's other limitations, mainly the fact that the Shared Matches tool (which is the only way to confirm if matches match each other and form a cluster) only includes estimated 4th cousins or closer (20+ cM). AncestryDNA really need to provide something more comprehensive. They say it's limited to 20+ cM because it would tax the server too much if they expanded it to include all matches. But at the very least, they could expand it to 15+ cM segments, which have a 100% chance of being identical by descent. That would still exclude most matches (8-15 cM) and therefore not be as taxing on the server, but include all matches that have a 100% chance of being IBD, which would make ThruLines so much more useful and reliable. At the moment, they are excluding hundreds, even thousands of IBD matches from the Shared Matches tool, which is extremely debilitating. Alternately, they could offer another tool that would be less taxing on the server - a simple one-to-one comparison. Pop in two match usernames, which would tell us whether those two matches match each other or not. Very simple, not very taxing, but it would get the job done.

Even so, it's still possible to get reliable usage out of ThruLines. Remember, ThruLines is only automating a process that people used to manually do (and still do when the relationship exceeds ThruLines' 5th great grandparent limit). If it weren't possible to use DNA to confirm relationships when there is no written record of it available, what use would DNA be, and how do you think all these NPEs are being discovered? While it's true that you do have to watch out for tree errors being replicated in ThruLines, if you understand how DNA and ThruLines work, there is useful data you can get out of it. To often, I see people who seem to completely dismiss ThruLines, as though it's not reliable at all, but you're only hindering your own research by thinking that.

Tuesday, August 10, 2021

Finding Unknown Biological Ancestors with DNA

This is a topic that comes up regularly in genealogy circles, because DNA testing often reveals cases of unknown adoptions, or what we call "non-paternity events" (NPE), when someone's father is not their biological father. Once there's enough suggestion that something like this has happened, the question then becomes, how do I identify this unknown biological ancestor? It can be done, although the further back on your tree it occurred, the more difficult it will be (far enough back and it might not be possible). Whenever possible, it's best to have someone from the oldest generation descended from this event to test. Like if you're looking for Grandma's unknown biological father, have Grandma take the test, or if she is unable or unwilling, have your relevant parent take the test. At the same time, if the person you're looking for is actually still living (like if you're adopted and looking for living biological parents), it will be difficult to research since lots of records on living people are private (that's a whole different ballgame and you often have to rely more on information and communication from your DNA matches). Additionally, if you're working with an endogamous population, you may be out of luck. With all that in mind, here's how it works.

Step 1: Look for your closest DNA match that you can't identify as being from another known branch of your tree. If they don't have a family tree added, that's okay because first you want to look at their Shared Matches, and open any matches that do have family trees (the bigger, the better).

Step 2: Compare the family trees of those Shared Matches, looking for ancestors any two or more (the more, the better) of them have in common with each other (especially if those matches also match each other) - ancestors who are not found in your tree. Yes, this may take some time because you have to manually compare the trees - I find it best to start with the surname list on the match review page and find surnames they have in common with each other, then see if those surnames actually lead to a common ancestor among them. If the ancestor is found in your tree, then you know this group isn't from the branch you're looking for and you can label them and move on.

Step 3: Build a descendant tree for the ancestor you found. Make a note of any descendants who were in the right place at the right time at the right age, but we're not done yet.

Step 4: Repeat this process with the next closest match you can't identify (who isn't a part of the first group).

Step 5: Look for a descendant who appears in both the trees you've built - so someone who descends from both the ancestors you've identified. This is probably either the person you're looking for, or a close ancestor of theirs, like a parent or grandparent. If you don't find one, keep repeating this process until you do.

Chart showing the two different DNA matches groups and their shared ancestors.
Click to enlarge.

For example (shown above - these names are made up but the situation is real and came from my tree): I was looking for my grandfather's unknown biological father, so I had my grandfather take the test before he died. I first found a group of his matches (who mostly all matched each other) who were all descended from a colonial ancestor named John Smith (I told you I changed the names, lol), so I built a descendant tree for John Smith. I then found another group of matches who all descended from another colonial ancestor called Christopher Jones, and built a descendant tree for him. By building those trees, I found a descendant of John Smith - named Isaac Smith - had married a descendant of Christopher Jones - her name was Carrie Jones. This suggested that the man I was looking for was probably a descendant of Isaac Smith and Carrie Jones, and based on the dates, it could only really be one of their sons, specifically one of their four oldest sons. Eventually, a close descendant of one of the four sons tested and confirmed which of the four sons was my grandfather's biological father (below).

Chart showing the closer matches that eventually showed up and allowed me to figure out
which of the 4 brothers was my grandfather's bio father.

Granted, there could have been another descendant of John Smith who married a different descendant of Christopher Jones, and that could have led me to the wrong family - this is why too much endogamy can throw you off. But as long as there's not too much of it, you can document each case of it and using your DNA matches and how much DNA you share with them, you should be able to figure out which descendants are the ones you're looking for. But a highly endogamous population might be too complex. If I was looking for an unknown bio ancestor on my mom's Mennonite branch, I'm not sure it would be possible. I can sometimes share up to about 5 ancestor couples with matches on my Mennonite branch. And the unknown father of my Italian ancestor who was from a tiny, highly endogamous town in Italy where everyone there is related to everyone else somehow? Forget it.

However, this is the same type of method that professional Genetic Genealogists like CeCe Moore employ to identify individuals from DNA left at crime scenes (either suspects or unidentified victims). It can be done (for the most part), it just takes work, and sometimes some patience for the right matches to come in.

Thursday, August 5, 2021

Understanding Admixture and Genetic Overlap at MyHeritageDNA

MyHeritage is generally known for not having the most reliable ethnicity results. This is probably because they were latecomers to the DNA field, and they haven't yet updated their percentage reports. But of course, all ethnicity percentages are merely an interpretation of our DNA, and not necessarily very reliable anyway. And what MyHeritage does do a great job of (unlike AncestryDNA), is showing us lots of data on all the genetic overlap among neighboring regions, so we can understand how it works. Not only do they show us all available regions and the areas they cover (below), but they have a section called "Ethnicity Maps" that shows us "the most common ethnicities in each country and the top countries for each ethnicity, according to MyHeritage DNA users' data." Although this is all based on data from MyHeritage, it's still a valuable learning tool for understanding admixture in general.

The percentages in the Ethnicity Maps show us the portion of testers in each country who get results in each ethnicity, or the portion of testers with results in each ethnicity within each country.

For example, looking at the data by country, if you click on Germany, you'll see 55.7% of people living in Germany get results in the "North and West European" ethnicity (above). We don't know what average percentage they get for "North and West European" because the data doesn't include that, but it's probably pretty high. It then goes on to list another 19 ethnicities down to 1.2%, from all over Europe, Asia, Africa, and even Native America. That is not all due to genetic overlap, but simply because there may be, for example, a few Asians living in Germany who took the test. For genetic overlap, it's probably best to look at the top 5 ethnicities - which is likely why their default view is the top 5. What that shows us is that lots of people in Germany also get results in East European (48.9% of testers), Scandinavian (43.6%), Balkan (38.1%), and English (23.3%), illustrating the strong genetic overlap Germany has with those nearby areas. That means if you have known German ancestry, it would not be uncommon to get results in any of those neighboring regions, especially (though not exclusively) from MyHeritage's results.

On the flip side, when you look at the Ethnicity Maps by ethnicity, it shows us the most common countries each ethnicity is found in. This gives us a good understanding of two things: the top 5-10 countries show us the areas covered by that category (although our own Ethnicity Estimate already gives us that, this can give us an understanding of just how broad that area could really be), and the full list of countries shows us how much emigration there's been from each country around the rest of the world. For example, 36.7% of people in the USA get results in North and West European, which is not surprising, considering how many German immigrants there have been to the USA over history. This doesn't really show us genetic overlap, but it is very useful for understanding modern migration patterns.

Hopefully, as MyHeritage update their ethnicity reports, they will also update this very useful data and not retire it like AncestryDNA keep doing.

Sunday, August 1, 2021

Understanding Admixture and Genetic Overlap at AncestryDNA

I talk a lot about the genetic overlap that exists among neighboring regions and how that influences ethnicity percentages, or admixture. Unfortunately, AncestryDNA keeps taking away valuable learning tools for understanding these relationships between various populations, making it harder to illustrate them. First, they removed the Average Admixture Chart, then they removed the Genetic Details page, and now they've even removed our ability to click on "see other regions tested" and explore the maps and details of any region to understand the overlap they have with neighboring regions. The only thing left is the PCA chart in the Ethnicity White Paper, but even that has always been limited to Europe. 

The Average Admixture Chart (below) used to show us what the results of a typical native of every region would expect to get. It showed how much or how little each population was admixed. So for example, if you were of 100% British descent, you could expect to actually only get about 60-70% in Great Britain (this was before they decided to attempt to split up Britain), and around 8-10% in Europe West, Ireland, and/or Scandinavia. This illustrated the common overlapping DNA among the British, Germanic people, and Scandinavians, and also the close relationship between the British and Irish (sorry, Ireland). Europe West was even more admixed, averaging less than 50% results in Europe West, and the rest coming from pretty much everywhere else in Europe except Finland/NW Russia. Scandinavia was less admixed, averaging between 80% to 90% in Scandinavia, and only small amounts from Europe West, Great Britain, Finland/NW Europe, Ireland, and a smidge from Europe East. The chart made it clear just how admixed Europeans themselves are, or can be, and to AncestryDNA, that is apparently a bad thing that they are now trying to hide, because it means ethnicity percentages, by nature, are always very reliable, and can't always be broken down into more specific regions. That's something customers are frustrated by, so one by one, they keep taking away the learning tools that would help customers understand this.

The loss of the Average Admixture chart wasn't too unfortunate, because the same/similar data could essentially be found on the Genetic Details page. Previously, when you clicked on a region, and then clicked "More info", there would be a page with two tabs - one which still remains with the detailed history of the population and their migrations, and the other had genetic details that helped us understand the genetic overlap that region had with nearby regions. That second page is now gone. It showed us two very important charts that basically replaced the data in the Average Admixture chart. The first one (below) showed us the average percentage that a native of that area would likely get for that region (same as you would find on the Average Admixture chart). 

The second chart (above) showed us "Other regions commonly seen in people native to [this region]". This wasn't exactly the same data from the Average Admixture chart - it rather detailed the amount of people native to that region who got any amount of results in which neighboring regions. So it didn't tell us the amount a native would expect to get in those other areas, but how common it was for a native to get results in those other areas. Not exactly the same data, but still valuable data for understanding common overlap.

With these two vitally important learning tools gone, I often turned to the simple map and details of each region to illustrate how each region often covers neighboring regions as well. If you click on "Read full history" for each region, you can find not only the areas "primarily found" in that region, but the areas "also found in" that region too (above). Unfortunately, AncestryDNA has neglected to add the "Read full history" link to some of the newer regions (like Scotland) they added recently! An oversight? Or an indication they may also be retiring this page altogether now too? And on top of that, a new revamp of the appearance of our ethnicity results (may not be available to everyone yet) seems purely aesthetic at first, until you notice the link to "See other regions tested" is now gone too (below). 

It's as though they don't want people to understand how much genetic overlap there is between certain regions, even though it would greatly help people to understand their results. And now, anytime people ask, "If I get results in X, is it coming from my Y ancestry?" and it's not a region I have results in, I can't answer them because I can't look up the map and details of regions I didn't personally get results in. This kind of question gets asked so frequently in social media, and frankly, people like me basically wind up fielding these questions for Ancestry's customer support, and they keep making it more and more difficult. I guess if they really want a huge increase in the load on their customer support, that's fine, but if that's the case, they really shouldn't have gotten rid of their support email (you can now only contact them by phone, or social media like Facebook). So, they're making it harder for customers to understand their results, and harder for customers to contact them about it. Epic fail on customer service, AncestryDNA.

The only remaining tool is the PCA chart (top), which is limited to Europe and therefore not much help in understanding results outside of Europe, or any relationships that might exist in the crossroads between Europe and other continents. And frankly, I have some concerns that voicing this will lead to them to remove the PCA chart too.

The percentage range included in our results is also useful for understanding that the percentages are very much an estimate, but not very useful for understanding the genetic overlap between regions. Still, hopefully they don't retire this feature either, but the ongoing trend doesn't bode well for it. 

Wednesday, May 26, 2021

Add Specific Relationship, AncestryDNA's Latest Feature

It sounds like it hasn't been rolled out to everyone yet, but it should be coming soon - AncestryDNA is (finally) adding the ability to change the estimated relationship range with a DNA match to a specific, known relationship instead. They're a few years behind 23andMe and FTDNA (although 23andMe still don't have shareable family trees so 23andMe is no better overall), but better late than never.

In the process of adding the specific relationship, it asks you which side of your tree the match is from, your mother's side, father's side, or both. And for matches you're unsure of the specific relationship, but you know which side of your tree they come from, there's an option to select which side and then instead of choosing a specific relationship, you can click "I'm not sure". It will then display "Mother's Side" or "Father's Side" (or both) without a exact relationship (the original estimated range will remain). 

Unfortunately, it does have some limitations. The main one is that it only goes out to 5th cousins, and any more distant relationships only have an option for lumping them all into a general "Distant Relationship" label. Not only does this rather defeat the purpose of being able to add a specific relationship if it's not actually a specific relationship, but it's also inconsistent with ThruLines, which at least goes out to 6th cousins (though that too is arguably a little limited). So essentially, ThruLines is going to show us our exact relationship with many 5th cousins once removed and 6th cousins, yet the new feature offers no way to add those specific relationships. The least they could do is expand it to the 6th cousins so it's consistent with ThruLines.

The other limitation is that it doesn't let you select more than one relationship, which is a complete oversight when it comes to lots of people who have endogamous branches of their tree, and identifiable endogamy (more than one set of most recent common ancestors) with many matches. Even when you select "Both Sides", it doesn't give you the option for more than one relationship. If it's a close match, assumes you've selected both sides because the person is someone like a niece or nephew, or full sibling, etc. Someone who shares your whole ancestry. If they aren't a close match, it seems to assume that although you may have two different relationships, they must be more distant than 5th cousins and only gives you the option to select "Distant Relationship". I suppose they're trying not to over complicate it for newcomers, but for people who use this for heavy research and breaking down brick walls in their tree, noting multiple relationships is vital.

It should also be noted that if one or more of your parents have tested, the system will automatically assign a match to your mother's side or father side depending on who they match. If for some reason, the system got it wrong, or only selected one when they actually match both, you can edit this by simply clicked the back button in the upper left corner of the side window (highlighted in yellow in the screenshot below).

That pretty much sums it up. In general, it's great they finally added this option, I know lots of people have been asking for it for a while. And I have gone through and selected known relationships for all the matches I've identified. But you may notice I have, for a very long time now, always noted the relationship and shared ancestor(s) in the notes field (along with emojis I used before groups were available). Unfortunately, due to the limitations of the new feature, I will have to continue noting the relationship myself in the notes field instead of relying solely on Ancestry's tool.

Wednesday, January 20, 2021

Augusta's Confederate Monument and My Family History

Sometimes, genealogy and modern politics have some overlap. 

In Augusta, Georgia, a giant, imposing obelisk stands as a memorial to Confederate soldiers. In particular, it hosts four life-sized statues of infamous Confederate military leaders, Robert E. Lee, Stonewall Jackson, William H.T. Walker, and Thomas R.R. Cobb (shown left), the latter of which is my 2nd cousin 6 times removed. It's a distant relation, for sure, but a significant one, given why this monument was erected in 1878 by the Ladies Memorial Association of Augusta.

Let's be clear about one thing - statues, monuments, and memorials are built in honor, respect, and celebration of the people, places, or events they represent. Monuments like this one are made in memory and respect of men whose only or primary contribution to history was to lead armies in the name of slavery. And when monuments like this are inscribed with quotes like the following, that fact becomes impossible to deny:

"No nation rose so white and fair 

None fell so pure of crime."

I'm not sure how anyone could say this is anything but blatantly racist, and this is the inscription found emblazoned under the feet of my cousin. That makes this a little bit personal for me - southerners like to shout about how this is their history, and their heritage. But for me, it literally is. 

And I say, tear it down.

The removal of statues like this isn't the removal of history. History belongs in museums and history books, which there is no lack of. There are plenty of Civil War and Confederate artifacts and memorials to be found in museums, where people can study and learn from them, and this monument could be one of them instead of being revered in the middle of town. When a memorial is erected in honor and celebration of Confederate leaders, it's honoring and celebrating slavery and racism. By not only leaving such a monument standing, but actively refusing to remove it, the government is not-so-subtly condoning a symbol of racism. We would never tolerate a statue of Hitler standing proudly in the middle of a city, paying homage and reverence to him, so why do we allow it for Confederate leaders?

Even if we remove all the slavery and racism from such symbolism (which isn't really possible, but just for the sake of argument), you're still left with a monument honoring men who tried to tear our nation apart with war and violence. Are we now going to see statues of the leading faces among the rioters who broke into the Capitol Building? Because they too are talking about civil war, they too are waving the Confederate flag, and they too believe their violence to be "fair" and "pure of crime". In reality, they are traitors, and so were Confederate leaders. And actually, many (if not all) of the rioters are also proud racists, just like Confederate leaders, as proven by the presence of white supremacy groups and Neo Nazi symbolism during the Capitol riot. Confederate leaders were also unabashed racists. This direct quote from my own cousin, Thomas R.R. Cobb, a Confederate General, proves it:

"This inquiry into the physical, mental, and moral development of the negro [sic] race seems to point them clearly, as peculiarly fitted for a laborious class. The physical frame is capable of great and long-continued exertion. Their mental capacity renders them incapable of successful self-development, and yet adapts them for the direction of the wiser race. Their moral character renders them happy, peaceful, contented and cheerful in a status that would break the spirit and destroy the energies of the Caucasian or the native American."

None of this means that every statue of every historical figure who ever did anything unethical should be removed. Yes, I am aware of the fact that most of our founding fathers were slaveholders, and that's certainly something I don't think we should hide from - we should talk about it, and we should teach it in schools. Our founding fathers were flawed, and that's something we have to accept. But the difference is, statues of founding fathers were not constructed in honor of them being slaveholders, they were erected in honor of all their positive achievements. Statues of Confederate leaders are produced in honor of their fight for slavery, and supporting that in the name of "history" or "heritage" is, frankly, bullshit. It's not in the name of history, it's in the name of racism and hatred. 

And if your heritage is racist, it's not something you should celebrate. Read about it in a history book, or visit a museum, or find something else about your heritage to celebrate that's not based on slavery, but do not protect a symbol of hate standing proudly in the middle of a city. 

Shame on the Augusta, Georgia government, and shame on everyone who enables them.

Friday, January 8, 2021

Small but Significant Changes at

Ancestry is rolling out some new tweaks to their website that has everyone in a tizzy and I don't really know why, because in many ways they seem like an improvement to me. 

One of the changes was the removal of the clickable alphabet at the top of our List of All People. It allowed us to jump to surnames that start with any letter by clicking on the letter. I know the removal of this seems like a negative, but it's really not. What remains are two name search fields, one for first name, the other for last name. They were always there, and they always offered the ability to do what the alphabet list offered too, which I imagine is why the alphabet was removed - there is no point in having two different tools that do the exact same thing. You can still jump to surnames starting with any letter by simply putting that letter into the surname search field. But even better than that, the search fields offers way more versatility than the alphabet did, because you can also do the same for the first name field (shown above), and you can use more than one letter, so you can quickly bring up all "Mc" or "Mac" surnames, for example (shown below). I believe this was always an option, a lot of people just apparently didn't realize it.

The bigger change is in hints, clicking on a hint now brings up a side bar to preview the hint (shown below) instead of it loading a whole new page. People are complaining that it requires more clicks to confirm and attach the hint now, but that's just not true. In the past, you had to click on the hint, it loaded a new page where you could review the record, then you clicked "yes" to the hint being correct and it loaded the page that allowed you to edit the data you're adding, then you click "save to your tree" and you were done. That's 3 clicks.

It's the same now. You click on the hint, but instead of it loading a new page, the side bar pops up. Instantly, I felt this was an improvement because you haven't left the person's profile page, so you can still fully view and compare all their data, and sources, etc. You can click on Facts, Gallery, etc and the side bar remains up, allowing for a full comparison (shown below). In the past, the only way to do this was to right click the hint and open it in a new tab, which you can still do, but now there's no need. I always did this, because I generally want to refer back to the profile while checking a hint. Now, finally, I don't have to open a new tab, which is going to make my workflow much more efficient. 

Even if you didn't open the hint in a new tab in the past like me, the number of clicks is still the same. After clicking on the hint and the side bar popping up, you click "yes" to the hint being correct and it loads the page where you can edit the data you're adding, and then you click "save to your tree" just like before. That's 3 clicks.

So I'm really not sure what the fuss is all about. The changes either won't slow your workflow, or they will actually improve it. Give it a chance, you might find it works better.