US Baby Names in 2015
Select any column or the entire dataset option for its boxplot.
Select any two columns or the entire dataset option to compute the correlation coefficient matrix.
Select any column to plot its cumulative frequency histogram.
Select any column for its dotplot.
Select any two columns to plot them simultaneously using a histogram.
Select any column to compute the arithmetic mean.
Select any column to create its pie chart.
Select any two columns to plot.
Select any two columns for a simple regression analysis. The first column selected will be the independent variable.
Select any column for its stem and leaf plot.
Select any column to compute its mean, variance, and also other summary statistics.
Select any column for various visual summaries.
2015 US Baby Names
For each year of birth YYYY after 1879, the Social Security Administration created a dataset which has the format "name,sex,number," where name is 2 to 15 characters, sex is M (male) or F (female) and "number" is the number of occurrences of the name. Each dataset is sorted first on sex and then on number of occurrences in descending order. When there is a tie on the number of occurrences, names are listed in alphabetical order. This sorting makes it easy to determine a name's rank. The first record for each sex has rank 1, the second record for each sex has rank 2, and so forth.
To safeguard privacy, we restrict our list of names to those with at least 5 occurrences. The original dataset can be found at https://www.ssa.gov/oact/babynames/limits.html
In 1998, the Social Security Administration published Actuarial Note #139, Name Distributions in the Social Security Area, August 1997, on the distribution of given names of Social Security number holders. The note, written by actuary Michael W. Shackleford, gave birth to these datasets.
All names are from Social Security card applications for births that occurred in the United States after 1879. Note that many people born before 1937 never applied for a Social Security card, so their names are not included in our data. For others who did apply, our records may not show the place of birth, and again their names are not included in our data.
People using our data on popular names are urged to explicitly acknowledge the following qualifications.
- Names are restricted to cases where the year of birth, sex, State of birth (50 States and District of Columbia) are on record, and where the given name is at least 2 characters long.
- Name data are not edited. For example, the sex associated with a name may be incorrect. Entries such as "Unknown" and "Baby" are not removed from the lists.
- Different spellings of similar names are not combined. For example, the names Caitlin, Caitlyn, Kaitlin, Kaitlyn, Kaitlynn, Katelyn, and Katelynn are considered separate names and each has its own rank.
- When two different names are tied with the same frequency for a given year of birth, we break the tie by assigning rank in alphabetical order.
- Some names are applied to both males and females (for example, Micah). Our rankings are done by sex, so that a name such as Micah will have a different rank for males as compared to females. When you seek the popularity of a specific name (see "Popularity of a Name"), you can specify the sex. If you do not specify the sex, we provide rankings for the more popular name-sex combination.
Add new comment
From Around the Site...
|Title||Authored on||Content type|
|R Dataset / Package DAAG / nassCDS||March 9, 2018 - 1:06 PM||Dataset|
|R Dataset / Package Stat2Data / LostLetter||March 9, 2018 - 1:06 PM||Dataset|
|R Dataset / Package car / AMSsurvey||March 9, 2018 - 1:06 PM||Dataset|
|R Dataset / Package robustbase / pension||March 9, 2018 - 1:06 PM||Dataset|
|R Dataset / Package DAAG / primates||March 9, 2018 - 1:06 PM||Dataset|