Samples arrive periodically as Dr. Wolberg reports his clinical cases. The database therefore reflects this chronological grouping of the data.
Format
A data frame with 699 observations on the following 11 variables.
sample_code_number
: id numberclump_thickness
: 1 - 10uniformity_of_cell_size
: 1 - 10uniformity_of_cell_shape
: 1 - 10single_epithelial_cell_size
: 1 - 10bare_nuclei
: 1 - 10bland_chromatin
: 1 - 10normal_nucleoli
: 1 - 10mitoses
: 1 - 10class
: 2 for benign, 4 for malignant
Details
This grouping information appears immediately below, having been removed from the data itself:
Group | Instances | Date of Collection |
1 | 367 | January 1989 |
2 | 70 | October 1989 |
3 | 31 | February 1990 |
4 | 17 | April 1990 |
5 | 48 | August 1990 |
6 | 49 | Updated January 1991 |
7 | 31 | June 1991 |
8 | 86 | November 1991 |
Total | 699 points | 15 July 1992 |
Note that the results summarized above in Past Usage refer to a dataset of size 369, while Group 1 has only 367 instances. This is because it originally contained 369 instances; 2 were removed.