If you deal with last names for any largish set of people, you ought to consider the following cases.
Family Names
These are some kinds of last names programmers I know have mangled or rejected:- MacDonald (included capital)
- O'Brien (included punctuation)
- Armstrong Zwicky (internal space, too long)
- d'Aulaire (no initial cap, included punctuation)
- van den Berg (two missing capitals, internal spaces)
- Smith y Ibarra (missing capital internal, internal spaces)
- O (one character)
- Satyanarayanan (too long)
- St. Pierre (different included punctuation, internal spaces)
- ffollett (no capitals at all)
First and Middle Names
First names can also have spaces in them. Some people with the name "Marie Claire Johnson" have the first name "Marie" and the middle name "Claire". Some of them have the first name "Marie Claire" and of those, many come from places where you would just know that, so they may not find it important to tell you. Oh, and if poor Marie Claire is tired of being called "Marie", she may write her name "Marie-Claire" so be prepared for hyphens. In fact, be prepared for all the same punctuation and length issues you saw with family names. Not everybody has a middle name. Some people have more than one. Some people have a middle initial but no name to go with it. Some people use their middle name and not their first name, so taking the official name in human resources may not work. Not everybody has a first name. Some people use only one name, which usually gets assigned to the "family name" slot. Some people use two initials. Some people are normally called something which has no relationship to their legal name at all.Changing Names
People, of both genders, change names (and occasionally genders!) for a variety of reasons. If you have a process where somebody shows up, you ask them for a name once, and that gets pushed to individual entries in some large number of databases which have no further co-ordination, plan what you are going to do for changes. Particularly changes caused by somebody (possibly you) getting things wrong the first time. Not that I spent four years fixing my name in various databases in college after a data-entry clerk typoed it, or anything.Additions to Names
Being able to tell Mr. John Smith Sr. from Mr. John Smith Jr. from Mr. John Smith III could save you from 10 years of shipping shareholder reports to a customer. Or it could have done so for one company my father bought from and his father held stock in. But apparently they thought those were decorative.Accented Characters
If you cannot deal with accents, be sure you don't get them as input, because Mr. Le Carré may be okay with becoming Le Carre, but will not be happy with becoming Le Carr or "invalid character". Also be aware that if your accent handling is inconsistent, you have no way of knowing whether Ms. Snör, Ms. Snoer, and Ms. Snor are three people, two people, or one person (in German they should be two, as "Snör" and "Snoer" are valid ways of writing the same name, but "Snor" isn't, but that isn't true for every language that uses ö).Conflicts and Special Conflicts
Names of people that are problematic:- Root
- Roger Oot
- Core
- True
- Test
Honorifics
You are best off letting people write in what they want, with a few options (Mr., Miss, Ms, Mrs., Dr.) presented as easy choices. If you allow people to leave it blank, do not automatically fill in a choice. I get mail for "Mr. Zwicky" and my father gets mail for "Ms. Zwicky" because of people's assumptions about what a blank means. If you do not allow people to leave it blank, provide a gender-neutral option. Do not assume honorifics translate automatically between languages. For a long time, I was Mme. Zwicky in French but Ms. Zwicky in English (because in English it's all about whether or not you're married, but in French it has to do with how grown up you are).
Version 1.9 last modified by Elizabeth Zwicky on 2008-07-23 at 18:55
Document data
Attachments:
No attachments for this document
Comments: 0