Package cattonum. R topics documented: May 2, Type Package Version Title Encode Categorical Features

Similar documents
Package fastdummies. January 8, 2018

Package knitrprogressbar

Package vinereg. August 10, 2018

Package robotstxt. November 12, 2017

Package bisect. April 16, 2018

Package SEMrushR. November 3, 2018

Package messaging. May 27, 2018

Package canvasxpress

Package ggimage. R topics documented: November 1, Title Use Image in 'ggplot2' Version 0.0.7

Package validara. October 19, 2017

Package repec. August 31, 2018

Package dkanr. July 12, 2018

Package censusr. R topics documented: June 14, Type Package Title Collect Data from the Census API Version 0.0.

Package ggimage. R topics documented: December 5, Title Use Image in 'ggplot2' Version 0.1.0

Package datasets.load

Package geojsonsf. R topics documented: January 11, Type Package Title GeoJSON to Simple Feature Converter Version 1.3.

Package ECctmc. May 1, 2018

Package splithalf. March 17, 2018

Package docxtools. July 6, 2018

Package kdtools. April 26, 2018

Package facerec. May 14, 2018

Package sfdct. August 29, 2017

Package rsppfp. November 20, 2018

Package editdata. October 7, 2017

Package githubinstall

Package ezsummary. August 29, 2016

Package TrafficBDE. March 1, 2018

Package datapasta. January 24, 2018

Package semver. January 6, 2017

Package calpassapi. August 25, 2018

Package gggenes. R topics documented: November 7, Title Draw Gene Arrow Maps in 'ggplot2' Version 0.3.2

Package qualmap. R topics documented: September 12, Type Package

Package IATScore. January 10, 2018

Package tidylpa. March 28, 2018

Package gtrendsr. October 19, 2017

Package oec. R topics documented: May 11, Type Package

Package rcv. August 11, 2017

Package fitbitscraper

Package crossword.r. January 19, 2018

Package projector. February 27, 2018

Package embed. November 19, 2018

Package statsdk. September 30, 2017

Package comparedf. February 11, 2019

Package WordR. September 7, 2017

Package infer. July 11, Type Package Title Tidy Statistical Inference Version 0.3.0

Package essurvey. August 23, 2018

Package keyholder. May 19, 2018

Package wrswor. R topics documented: February 2, Type Package

Package spark. July 21, 2017

Package jpmesh. December 4, 2017

Package vip. June 15, 2018

Package kirby21.base

Package readxl. April 18, 2017

Package goodpractice

Package tidyimpute. March 5, 2018

Package tibble. August 22, 2017

Package nngeo. September 29, 2018

Package pdfsearch. July 10, 2018

Package ecoseries. R topics documented: September 27, 2017

Package opencage. January 16, 2018

Package colf. October 9, 2017

Package postal. July 27, 2018

Package spelling. December 18, 2017

Package auctestr. November 13, 2017

Package tidytransit. March 4, 2019

Package d3plus. September 25, 2017

Package tidytree. June 13, 2018

Package rgho. R topics documented: January 18, 2017

Package gtrendsr. August 4, 2018

Package internetarchive

Package autocogs. September 22, Title Automatic Cognostic Summaries Version 0.1.1

Package dbx. July 5, 2018

Package sigqc. June 13, 2018

Package widyr. August 14, 2017

Package pwrab. R topics documented: June 6, Type Package Title Power Analysis for AB Testing Version 0.1.0

Package loggit. April 9, 2018

Package guardianapi. February 3, 2019

Package pmatch. October 19, 2018

Package data.world. April 5, 2018

Package cancensus. February 4, 2018

Package climber. R topics documented:

Package GetITRData. October 22, 2017

Package triebeard. August 29, 2016

Package censusapi. August 19, 2018

Package catenary. May 4, 2018

Package preprosim. July 26, 2016

Package bigreadr. R topics documented: August 13, Version Date Title Read Large Text Files

Package available. November 17, 2017

Package fingertipsr. May 25, Type Package Version Title Fingertips Data for Public Health

Package wikitaxa. December 21, 2017

Package liftr. R topics documented: May 14, Type Package

Package modules. July 22, 2017

Package epitab. July 4, 2018

Package textrecipes. December 18, 2018

Package barcoder. October 26, 2018

Package sfc. August 29, 2016

Package raker. October 10, 2017

Package stapler. November 27, 2017

Package virustotal. May 1, 2017

Package xtractomatic

Transcription:

Type Package Version 0.0.2 Title Encode Categorical Features Package cattonum May 2, 2018 Functions for dummy encoding, frequency encoding, label encoding, leave-one-out encoding, mean encoding, median encoding, and one-hot encoding. Maintainer Bernie Gray <bfgray3@gmail.com> URL https://github.com/bfgray3/cattonum BugReports https://github.com/bfgray3/cattonum/issues Encoding UTF-8 Depends R (>= 3.2.0) Imports dplyr (>= 0.7.0), stats, tidyselect (>= 0.2.3) LazyData true License MIT + file LICENSE Suggests covr, knitr, nycflights13, ranger, rmarkdown, that, tibble RoxygenNote 6.0.1 VignetteBuilder knitr NeedsCompilation no Author Bernie Gray [aut, cre], Mark Roepke [ctb] Repository CRAN Date/Publication 2018-05-02 03:18:21 UTC R topics documented: cattonum.......................................... 2 catto_dummy........................................ 2 catto_freq.......................................... 3 1

2 catto_dummy catto_label.......................................... 3 catto_loo.......................................... 4 catto_mean......................................... 5 catto_median........................................ 6 catto_onehot......................................... 6 Index 8 cattonum cattonum: Encode Categorical Features Functions for dummy encoding, frequency encoding, label encoding, leave-one-out encoding, mean encoding, median encoding, and one-hot encoding. catto_dummy Dummy encoding Dummy encoding catto_dummy(,...,, = TRUE) The ing data, in a data.frame or tibble. Should informative messages be printed? Defaults to TRUE (not yet used). a list with names "" and "" is returned holding the encoded ing and datasets. catto_dummy(iris)

catto_freq 3 catto_freq Frequency encoding Frequency encoding catto_freq(,...,, = TRUE) The ing data, in a data.frame or tibble. Should informative messages be printed? Defaults to TRUE (not yet used). a list with names "" and "" is returned holding the encoded ing and datasets. catto_freq(iris) catto_label Label encoding Label encoding catto_label(,...,, ordering = "increasing", = TRUE, seed = 4444)

4 catto_loo The ing data, in a data.frame or tibble. ordering How should labels be assigned to levels? There are three different ways to pass this argument. First, a length one character vector with value "increasing", "decreasing", "observed", or "random" will apply that ordering to each column being encoded. Second, a character vector of length greater than one may be passed, specifying one of the above four options for each column being encoded. Finally, a list may be passed specifying a user-defined ordering for each column being encoded. seed Should informative messages be printed? Defaults to TRUE (not yet used). The random seed set before all random ordering encodings if there are any. a list with names "" and "" is returned holding the encoded ing and datasets. catto_label(iris) y <- 2 ^ (0:5) x1 <- c("a", "b", NA, "b", "a", "a") x2 <- c("c", "c", "c", "d", "d", "c") df_fact <- data.frame(y, x1, x2) catto_label(df_fact, ordering = list(c("b", "a"), c("c", "d"))) catto_label(df_fact, ordering = c("increasing", "decreasing")) catto_loo Leave-one-out encoding Leave-one-out encoding catto_loo(,..., response,, = TRUE)

catto_mean 5 The ing data, in a data.frame or tibble. response The response variable used to calculate means. Should informative messages be printed? Defaults to TRUE. a list with names "" and "" is returned holding the encoded ing and datasets. catto_loo(iris, response = Sepal.Length) catto_mean Mean encoding Mean encoding catto_mean(,..., response,, = TRUE) The ing data, in a data.frame or tibble. response The response variable used to calculate means. Should informative messages be printed? Defaults to TRUE. a list with names "" and "" is returned holding the encoded ing and datasets. catto_mean(iris, response = Sepal.Length)

6 catto_onehot catto_median Median encoding Median encoding catto_median(,..., response,, = TRUE) The ing data, in a data.frame or tibble. response The response variable used to calculate medians. Should informative messages be printed? Defaults to TRUE. a list with names "" and "" is returned holding the encoded ing and datasets. catto_median(iris, response = Sepal.Length) catto_onehot One-hot encoding One-hot encoding catto_onehot(,...,, = TRUE)

catto_onehot 7 The ing data, in a data.frame or tibble. Should informative messages be printed? Defaults to TRUE (not yet used). a list with names "" and "" is returned holding the encoded ing and datasets. catto_onehot(iris)

Index catto_dummy, 2 catto_freq, 3 catto_label, 3 catto_loo, 4 catto_mean, 5 catto_median, 6 catto_onehot, 6 cattonum, 2 cattonum-package (cattonum), 2 8