glm {SparkR} | R Documentation |
Fits a generalized linear model, similarly to R's glm().
glm(formula, family = gaussian, data, weights, subset, na.action, start = NULL, etastart, mustart, offset, control = list(...), model = TRUE, method = "glm.fit", x = FALSE, y = TRUE, contrasts = NULL, ...) ## S4 method for signature 'formula,ANY,SparkDataFrame' glm(formula, family = gaussian, data, epsilon = 1e-06, maxit = 25)
formula |
A symbolic description of the model to be fitted. Currently only a few formula operators are supported, including '~', '.', ':', '+', and '-'. |
family |
A description of the error distribution and link function to be used in the model. This can be a character string naming a family function, a family function or the result of a call to a family function. Refer R family at https://stat.ethz.ch/R-manual/R-devel/library/stats/html/family.html. |
data |
SparkDataFrame for training. |
epsilon |
Positive convergence tolerance of iterations. |
maxit |
Integer giving the maximal number of IRLS iterations. |
a fitted generalized linear model
## Not run: sc <- sparkR.init() sqlContext <- sparkRSQL.init(sc) data(iris) df <- createDataFrame(sqlContext, iris) model <- glm(Sepal_Length ~ Sepal_Width, df, family="gaussian") summary(model) ## End(Not run)