Skip to contents

The following is an R data package that features certain data sets from the Machine Learning Library at UC Irvine. These data sets have been cleaned up and provide documentation via R’s help system.

[!NOTE]

Want to easily access data sets not included in this package?

Check out the {ucimlrepo} R package! The package provides an interface to download and automatically load data sets from the UC Irvine Machine Learning Repository.

Installation

You can install ucidata from github with:

# install.packages("remotes")
remotes::install_github("coatless-rpkg/ucidata")

Using data in the package

There are two ways to access the data contained within this package.

The first is to load the package itself and type the name of a data set. This approach takes advantage of R’s lazy loading mechansim, which avoids loading the data until it is used in R session. For details on how lazy loading works, please see Section 1.17: Lazy Loading of the R Internals manual.

# Load the `ucidata` package
library("ucidata")

# See the first 10 observations of the `autompg` dataset
head(autompg)

# View the help documentation for `autompg`
?autompg

The second approach is to use the data() command to load data on the fly without and type the name of a data set.

# Loading `autompg` without a `library(ucidata)` call
data("autompg", package = "ucidata")

# See the first 10 observations of the `autompg` dataset
head(autompg)

# View the help documentation for `autompg`
?autompg

Included Data Sets

The following data sets are included in the ucidata package:

Build Scripts

Want to see how each data set was imported? Check out the data-raw folder!