Saturday, July 19, 2025

Saturday Night Genealogy Fun: Do Some FamilySearch Full-Text Searching

Randy Seaver has more AI in store for us for this week's Saturday Night Genealogy Fun.

Your mission, should you decide to accept it (cue the Mission:  Impossible! music) is:

1.  FamilySearch Full-Text Search continues to add databases and searchable images to their collections.  This is a gold mine, especially of land, probate, and court records.

2.  Pick one or two of your ancestors and see what you can find on FamilySearch Full-Text Search about them.

3.  Share your Full-Text Search find(s) in your own blog post or in a Facebook, SubStack, BlueSky, or other social media post.  Leave a link to your post on this blog post to help us find your post.

The results for most of my searches for my ancestors (and remember, an ancestor is someone from whom you descend, not a collateral relative; since there is no such thing as an "indirect ancestor", the term "direct ancestor" is redundant at best and nonsense otherwise) either had far too many results to look through, when I searched for just a surname, or no results at all, when I searched for full names in quotation marks to control the number of irrelevant results.  My names are not extremely common ones, such as Smith and Jones, but they are common enough that a blind search with no index produces far too many results to slog through.  My grandmother's name was Anna Gauntt, with no middle name that I have ever discovered, so I searched for "anna gauntt" and learned that the AI provides results with something between your search terms.  I was able to rule out all of those Anna Gauntts, because they either had middle names or initials or were not in the correct locations for my grandmother.  I abandoned my ancestors and searched for some of the unique surnames that I am doing research on.

My search for Gudapel, a name which has been used by only four people in the history of the world, produced two results.  Both had headers that read "History Records 1800–1902, Diaries 1800–1902 | New Hampshire. Genealogies 1978–1982, Society Records 1978–1982 | Maine. Genealogies 1978–1982, Society Records 1978–1982" (truncated on screen, but visible when I moused over the link).  If I were searching for a name that I did not know as well, I might have ignored these results, because I know the family was never in New Hampshire and Maine and would have no reason to appear in genealogies for those area.  Because I do know the name, I clicked on the links, hoping that maybe there was something from the 19th century, and discovered that the title was not particularly accurate.  Both links went to the same book, a 1941 Houston, Texas city directory, which did include the name of Geo. [George] Gudapel on two different pages.  While city directories can be classified as history records, the directory in question does not fall in the years of 1800–1902.  I'm not sure if I already had these directory listings.

I next searched for McStroul, a name which my aunt's grandfather created when he naturalized as an American citizen.  His original name was Moska Leib Strul.  He asked to have it changed to Leo Martin McStroul when he became a citizen.  The name McStroul belongs only to his family; when I find it, I know it has to be the right people.  When I entered it in the keyword field, I had 42 results.

I searched for McStroul when Randy did a previous FamilySearch Full-Text Search challenge.  At that time I had two results, both in my aunt's great-grandmother's will and probate.  The name appeared because my aunt's mother, who was the granddaughter of the deceased, was listed under her married name.  I noted at the time that the name actually appeared three times in the document, but only two were identified by the search.  This time the search picked up all three occurrences.

All 42 results for McStroul that Full-Text Search found were correctly read.  A couple of the given names were misread by the AI (such as Geo instead of what actually said Leo).  Many of the documents were ones that I have not previously found, such as articles in a Kingman, Arizona newspaper and naturalizations in Milwaukee, Wisconsin where the McStroul family members appeared as witnesses.  So these are all useful in researching the family and are nice discoveries.

I tried searching for Szocherman, a name in my family that I believe more and more may be unique to that branch of my cousins.  I had no results.  But when I searched for Socherman, a spelling which some family members have adopted, I found many results, almost all of which were not for my cousins.  Amusingly, one that was my family was misread by AI and actually does say Szocherman.

After all of that, I tried to find a collateral relative in the database and searched for "frederick dunstan" in quotation marks.  Again the AI provided results with something between my search terms, and there were far more Frederick Dunstans than I had anticipated, more than was practical to look at.  I restricted the search to New Jersey and ended up with only four results, three of which were for Frederick C. Dunstan in Burlington County, which is exactly where my great-grandmother's brother should be found.  I believe this is the right person, so it appears he had a middle name, which I previously did not know, that started with C.  All three results are from deed indices from the early 1920's.  That does provide me with information about him.

Before I restricted the search to New Jersey, some of the results were for a Frederick Dunstan in Allegheny County, Pennsylvania.  I looked at them bcause that isn't horribly far from New Jersey and found that they were for a Frederick Dunstan from Combe Martin, England, which is in Devon County.  My Dunstans were from Lancashire, so I knew this was not my guy, but ironically, his wife's name was Jane.  Jane Dunstan is my great-grandmother and the sister of Frederick Dunstan.  I did find that entertaining.

10 comments:

  1. I've had the opposite results with the AI, it doesn't show names that vary, except for tagging first and last names but not the middle initial. I'm doing a presentation for the Sac Germ group this Tuesday and show how I had to use wildcards to locate an odd German name.

    ReplyDelete
    Replies
    1. That's an interesting difference. I wonder how its algorithms are set up that it behaves so differently.

      I saw that you are scheduled for that presentation. I hope it goes well.

      Delete
  2. You are absolutely right to comment on the title of the collections. Sometimes, they are spot on and, at other times, the "collection" that appears has 10 different items in it and the title doesn't refer to say, item 8, which is the one that is relevant for us. It's still better than nothing, though!

    ReplyDelete
    Replies
    1. It was definitely better than nothing! I did find something useful for my own family, after all. I think the biggest problem with inaccurate labels, though, is that people will be encouraged to look through everything, all the time, just in case the title is inaccurate. That is not a productive way to do research.

      Delete
    2. I too had noticed the issue with collection names...a frustrating side of this new feature at FS, but nothing is ever perfect. And I agree with Janice - we can waste a lot of time if we go through every result, so have to learn to take cues from the brief summary we can see in the results to determine if it's a record worth looking at.

      Delete
    3. Thanks for your support, Tess! It's always good to know you are not the only person with a particular viewpoint.

      Delete
  3. Congrats on making some useful finds! And I agree that it's time-consuming to have to slog through all results because collections aren't always accurately named, but we did that anyway in the ole days before online search, right?!

    ReplyDelete
    Replies
    1. We did slog through stuff before online searches, true, but my recollection is that the stuff was generally more accurately identified. Not always, of course, and if your relative happened to be somewhere you never would think of looking for him, then you wouldn't find him. We also used indices more extensively, which helps direct research more effectively than blind searches without context. But if I can't trust an archive to name its collections accurately, I start to question more about it overall.

      Delete
  4. It's great to see people finding new-to-them documents with full-text search. Hopefully over time the kinks will get worked out to make it less of a slog.

    ReplyDelete
    Replies
    1. I'm still wondering why the collections are so badly misidentified. That should be a relatively straightforward thing to fix.

      Delete

All comments on this blog will be previewed by the author to prevent spammers and unkind visitors to the site. The blog is open to everyone, particularly those interested in family history and genealogy.