* Name variants
- gwilym'smum
- Superstar
- Posts: 302
- Joined: 01 Feb 2016 16:28
- Family Historian: V6.2
- Location: South Cheshire
- Contact:
Name variants
Hello, I have a family some of whom use Pickering and some use Pickerill. I enter them as they are recorded in the records (I do know they are the same family as sometimes the individuals use both forms). When I wish to find individuals from the records window is it possible to get all the individuals from both forms to appear together please. Thank you Ann
Researching Mayer, Parr/Parr, Simcock, Beech and all related families
- johnmorrisoniom
- Megastar
- Posts: 882
- Joined: 18 Dec 2008 07:40
- Family Historian: V7
- Location: Isle of Man
Re: Name variants
Hi Ann,
In the filter box at the top of the records window, just type in Pickeri and the record list will show only those records that match (including both variants)
In the filter box at the top of the records window, just type in Pickeri and the record list will show only those records that match (including both variants)
- gwilym'smum
- Superstar
- Posts: 302
- Joined: 01 Feb 2016 16:28
- Family Historian: V6.2
- Location: South Cheshire
- Contact:
Re: Name variants
Why didn't I think of that! don't think outside the box. Thank you very much. 
Researching Mayer, Parr/Parr, Simcock, Beech and all related families
Re: Name variants
Name variants are a huge topic and cause all kinds of problems. There have been many postings about this (including some from me) and there are sure to be many more.
As a rider to John's reply, the FH filter box operates as an initial match so typing in "Pickeri" finds Ann's variants just fine.
In other cases it may be less than 100% effective. Take a family that has variants Dennes, Denness, Dennis. An initial match with "Denn" is OK, but will also find things like Dennison, Dennet[t] and so on.
Using clever stuff like "regular expressions" (too hard for me) may be the ideal solution, or even just wild-cards ("Denn*s" work for me on Ancestry, Family Search, etc.).
But there is another issue that becomes important if this is one of your "big names" in the family tree... the Records Window list has to sort on something and usually this is Surname (followed by Given name(s), etc.). Ann's list will separate Pickerill and Pickering as two separate blocks perhaps pages apart.
As Ann says some of them use both forms, we can't know in advance which form is the surname entered in FH. So it would be better to sort on a kind of generic (or standardised or normalised) surname if possible. That way, Anne Pickering could find its way between Ann Pickering and Anne Pickerill in the list.
Different people come up with different solutions. For the "Dennis" problem mine is to replace all such surnames with a fictional "DNS", and all lists keep everyone in one block. This works fine even if their surnames change over time or in different sources.
As for knowing what they were called at any particular point in time, well our source records spell it out. And just to summarise the range of surname usage for any one person I have an extra Surname added to the main Name field, containing some text like "Dennis, Denness" which can be shown on diagrams or in reports.
I use a similar approach for given names. Too often "Ann" and "Anne" are used indiscriminately, but "Ann" for all keeps them all together in the lists and it is very clear that "Anne Dennis (b.1835)" may be a duplicate of "Ann Dennes (b.c.1834)".
What's a name, anyway?
As a rider to John's reply, the FH filter box operates as an initial match so typing in "Pickeri" finds Ann's variants just fine.
In other cases it may be less than 100% effective. Take a family that has variants Dennes, Denness, Dennis. An initial match with "Denn" is OK, but will also find things like Dennison, Dennet[t] and so on.
Using clever stuff like "regular expressions" (too hard for me) may be the ideal solution, or even just wild-cards ("Denn*s" work for me on Ancestry, Family Search, etc.).
But there is another issue that becomes important if this is one of your "big names" in the family tree... the Records Window list has to sort on something and usually this is Surname (followed by Given name(s), etc.). Ann's list will separate Pickerill and Pickering as two separate blocks perhaps pages apart.
As Ann says some of them use both forms, we can't know in advance which form is the surname entered in FH. So it would be better to sort on a kind of generic (or standardised or normalised) surname if possible. That way, Anne Pickering could find its way between Ann Pickering and Anne Pickerill in the list.
Different people come up with different solutions. For the "Dennis" problem mine is to replace all such surnames with a fictional "DNS", and all lists keep everyone in one block. This works fine even if their surnames change over time or in different sources.
As for knowing what they were called at any particular point in time, well our source records spell it out. And just to summarise the range of surname usage for any one person I have an extra Surname added to the main Name field, containing some text like "Dennis, Denness" which can be shown on diagrams or in reports.
I use a similar approach for given names. Too often "Ann" and "Anne" are used indiscriminately, but "Ann" for all keeps them all together in the lists and it is very clear that "Anne Dennis (b.1835)" may be a duplicate of "Ann Dennes (b.c.1834)".
What's a name, anyway?
Paul White
"Family Historian is not just for Christmas, but for Life"
"Family Historian is not just for Christmas, but for Life"
- LornaCraig
- Megastar
- Posts: 2989
- Joined: 11 Jan 2005 17:36
- Family Historian: V7
- Location: Oxfordshire, UK
Re: Name variants
Rather than adding an extra surname to the main name field you can use the Alternate Name field(s) to add all the variations you find. These can be added to your diagram text scheme or included in reports.dbnut wrote: And just to summarise the range of surname usage for any one person I have an extra Surname added to the main Name field, containing some text like "Dennis, Denness" which can be shown on diagrams or in reports.
Lorna
- tatewise
- Megastar
- Posts: 27078
- Joined: 25 May 2010 11:00
- Family Historian: V7
- Location: Torbay, Devon, UK
- Contact:
Re: Name variants
Another possibility to consider is the SOUNDEX qualifier, which will tend to group like sounding Surnames together.
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry
Re: Name variants
Thanks, Lorna, that's definitely an option.LornaCraig wrote:you can use the Alternate Name field(s)
In my case, though, I already use extra Name instances to document "significant" changes that dramatically impact dictionary order - mostly deliberate adoption of different forms like "Alice Brown Smith" to "Alice Brown-Smith" or other stark changes, including sometimes recording married names (don't usually bother). Agreed these can be displayed/printed.
A slight objection here is that it creates (or permits) a full name structure.
Now you get me thinking, something I ought to consider is adding one new Surname instance to the (normalised) Main Name for each recorded variant. But to save space I'd prefer to keep the whole variant surname list when displayed/printed to a single line and I simply have no idea how to do that.
Perhaps some clever so-and-so can show me how to do that (with scripting or really smart use of functions???).
Paul White
"Family Historian is not just for Christmas, but for Life"
"Family Historian is not just for Christmas, but for Life"
Re: Name variants
Hey, Mike, more food for thought...tatewise wrote:the SOUNDEX qualifier
Searching online sources with that technique has given me very mixed success. And anyway there seems to be no way to incorporate SOUNDEX in query Row criteria or functions?
Paul White
"Family Historian is not just for Christmas, but for Life"
"Family Historian is not just for Christmas, but for Life"
- tatewise
- Megastar
- Posts: 27078
- Joined: 25 May 2010 11:00
- Family Historian: V7
- Location: Torbay, Devon, UK
- Contact:
Re: Name variants
%INDI.NAME:SOUNDEX% can be used wherever any other Data Ref can be used in Columns tab, Rows tab, functions, etc.
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry
Re: Name variants
Hehe, thanks, I learn something new every time!tatewise wrote:%INDI.NAME:SOUNDEX%
Paul White
"Family Historian is not just for Christmas, but for Life"
"Family Historian is not just for Christmas, but for Life"
- tatewise
- Megastar
- Posts: 27078
- Joined: 25 May 2010 11:00
- Family Historian: V7
- Location: Torbay, Devon, UK
- Contact:
Re: Name variants
To display multiple variant Surnames on one line would use something like:
%INDI.NAME[1]% %INDI.NAME[2]:SURNAME% %INDI.NAME[3]:SURNAME% and so on...
If any variant does not exist then nothing is displayed.
The only small caveat is that the intervening spaces are all displayed.
%INDI.NAME[1]% %INDI.NAME[2]:SURNAME% %INDI.NAME[3]:SURNAME% and so on...
If any variant does not exist then nothing is displayed.
The only small caveat is that the intervening spaces are all displayed.
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry
Re: Name variants
Huh, never thought to try that out! Reckon 5 should be enough for me.tatewise wrote:%INDI.NAME[1]% %INDI.NAME[2]:SURNAME% %INDI.NAME[3]:SURNAME% and so on
Huh, should have expected that from other experiments!tatewise wrote:If any variant does not exist then nothing is displayed.
Wouldn't be much of a problem as redundant ones would be limited in number and come at the end. Nice!tatewise wrote:The only small caveat is that the intervening spaces are all displayed.
Better still if we had a Trim function???
Paul White
"Family Historian is not just for Christmas, but for Life"
"Family Historian is not just for Christmas, but for Life"
- tatewise
- Megastar
- Posts: 27078
- Joined: 25 May 2010 11:00
- Family Historian: V7
- Location: Torbay, Devon, UK
- Contact:
Re: Name variants
There is in effect a trim function, that is even more flexible:
=CombineText(" ",%INDI.NAME[2]:SURNAME%,,)
That combines the first 3 parameters only if the the 2nd parameter produces text, otherwise it uses the 4th parameter.
So in the above case it only produces a space before the Surname if the Surname exists.
Therefore, to produce a trimmed list of full-name followed by alternative surnames try using:
=Text( %INDI.NAME% . CombineText(" ",%INDI.NAME[2]:SURNAME%,,) . CombineText(" ",%INDI.NAME[3]:SURNAME%,,) . CombineText(" ",%INDI.NAME[4]:SURNAME%,,) )
If you prefer, you can replace each " " with ", " so the Surnames are comma separated, but only where they exist.
=CombineText(" ",%INDI.NAME[2]:SURNAME%,,)
That combines the first 3 parameters only if the the 2nd parameter produces text, otherwise it uses the 4th parameter.
So in the above case it only produces a space before the Surname if the Surname exists.
Therefore, to produce a trimmed list of full-name followed by alternative surnames try using:
=Text( %INDI.NAME% . CombineText(" ",%INDI.NAME[2]:SURNAME%,,) . CombineText(" ",%INDI.NAME[3]:SURNAME%,,) . CombineText(" ",%INDI.NAME[4]:SURNAME%,,) )
If you prefer, you can replace each " " with ", " so the Surnames are comma separated, but only where they exist.
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry
- AdrianBruce
- Megastar
- Posts: 1961
- Joined: 09 Aug 2003 21:02
- Family Historian: V7
- Location: South Cheshire
- Contact:
Re: Name variants
For anyone not familiar with the Soundex system (see https://en.wikipedia.org/wiki/Soundex), it comes with a considerable number of caveats. The most important point is that it (sort of) only encodes the first letter and the next 3 consonants. Thus anyone with long names will probably find the latter part of those names being ignored - e.g. my Pickstocks get equated in Soundex with all sorts of Eastern European names. If your scope is your own database for Soundex analysis, then you probably won't have that problem but others may arise - Pickstock keeps getting equated to Pickstone - which is a correct equivalence in Soundex but I have seen zero evidence of any equivalence in real life!tatewise wrote:Another possibility to consider is the SOUNDEX qualifier ...
By all means try it - you may find it useful. Or you may want to stick it where....
Adrian
- tatewise
- Megastar
- Posts: 27078
- Joined: 25 May 2010 11:00
- Family Historian: V7
- Location: Torbay, Devon, UK
- Contact:
Re: Name variants
Adrian, your description of the Soundex encoding is an oversimplification, because some consonants are ignored or combined, so often more than the next 3 after the first are encoded. See rules 1 and 3.
Also Pickstock Soundex is P232 whereas Pickstone Soundex is P235 so they are similar but not equivalent.
The Pickering Soundex is P265 and the Pickerill Soundex is P264.
Interestingly, variants such as Peckerring and Peckerell or Packarrill have similar Soundex codes, but cannot use the FH Filter to bring them together.
Also Pickstock Soundex is P232 whereas Pickstone Soundex is P235 so they are similar but not equivalent.
The Pickering Soundex is P265 and the Pickerill Soundex is P264.
Interestingly, variants such as Peckerring and Peckerell or Packarrill have similar Soundex codes, but cannot use the FH Filter to bring them together.
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry
- AdrianBruce
- Megastar
- Posts: 1961
- Joined: 09 Aug 2003 21:02
- Family Historian: V7
- Location: South Cheshire
- Contact:
Re: Name variants
An oversimplification indeed - hence my inserting "sort of" in italics. Interesting that you calculated Pickstock and Pickstone slightly differently - those two are the bane of my life (or, at least, my research) in being shown as equivalents on various sites with zero justification in real life (unless anyone knows different!). It is often hard to discern the logic used to identify equivalents on various sites. I have seen lots of people reject such logic in favour of wild cards - regrettably, "my" names seem not to suit wild cards and phonetic matching in whatever form seems somewhat better, even if Soundex itself is poor for the names that I have an interest in.
Adrian
- tatewise
- Megastar
- Posts: 27078
- Joined: 25 May 2010 11:00
- Family Historian: V7
- Location: Torbay, Devon, UK
- Contact:
Re: Name variants
The Pickstock and Pickstone Soundex calculations are not mine but FH using %INDI.NAME:SOUNDEX% as mentioned earlier, which appears to use the Wikipedia rules.
i.e.
Using the Wikipedia rules for Pickstock:
i.e.
Using the Wikipedia rules for Pickstock:
- Retain first letter P and remaining consonants are ckstck
- Replacement digits are 222322
- Remove duplicate digits 232
- Thus the Soundex is P232
- Retain first letter P and remaining consonants are ckstn
- Replacement digits are 22235
- Remove duplicate digits 235
- Thus the Soundex is P235
Mike Tate ~ researching the Tate and Scott family history ~ tatewise ancestry
- AdrianBruce
- Megastar
- Posts: 1961
- Joined: 09 Aug 2003 21:02
- Family Historian: V7
- Location: South Cheshire
- Contact:
Re: Name variants
I would agree Mike - no proper Soundex calculation should come up with Pickstock and Pickstone being identical. I have to presume, therefore, that the site(s) that do equate the two are not actually using Soundex per se but some other algorithm. Next time that I see the equivalence of the two, I shall examine what (if anything) they call their matching process.
I think that my overall position is still that Soundex can produce odd equalities because of the truncation, but anyone's FH file might well not include any of the oddities.
I think that my overall position is still that Soundex can produce odd equalities because of the truncation, but anyone's FH file might well not include any of the oddities.
Adrian