Non-call expression encountered

Pino · November 23, 2022, 6:44pm

New to Julia, coming from matlab.

I’m tying to run a regression using a dataframe but I want to use a specific range of the dataframe as I have many covariates.

So instead of running the following

df_test = DataFrame(A = rand(Int, 100), B = rand(Int, 100), C = rand(0:1, 100) )

model_test = glm(@formula(C ~ A + B+C),
 df_test, Binomial(), LogitLink())

I would like to do like (in a wrong syntax reminescent of matlab):

model_test = glm(@formula(C ~ A + df_test[:,2:end]),
 df_test, Binomial(), LogitLink())

bertschi · November 23, 2022, 8:45pm

Macroexpanding @formula shows that it just creates a call of ~, + etc on symbolic representations of Term objects, i.e.,

julia> @macroexpand @formula C ~ A + B
:(StatsModels.Term(:C) ~ StatsModels.Term(:A) + StatsModels.Term(:B))

Thus, we can just construct the desired expression directly using functions only

julia> using StatsModels

julia> my_formula = Term(:C) ~ +( (Term(Symbol(x)) for x ∈ names(df_test)[2:end])...)
FormulaTerm
Response:
  C(unknown)
Predictors:
  B(unknown)
  C(unknown)

julia> model_test = glm(my_formula, df_test, Binomial(), LogitLink())
StatsModels.TableRegressionModel{GeneralizedLinearModel{GLM.GlmResp{Vector{Float64}, Binomial{Float64}, LogitLink}, GLM.DensePredChol{Float64, LinearAlgebra.Cholesky{Float64, Matrix{Float64}}}}, Matrix{Float64}}

C ~ 1 + B + C

Alternatively, you can just pass the data as a design matrix and a target vector directly:

julia> model_test = glm(Matrix(df_test[:, 2:end]), df_test[:, :C], Binomial(), LogitLink())
GeneralizedLinearModel{GLM.GlmResp{Vector{Float64}, Binomial{Float64}, LogitLink}, GLM.DensePredChol{Float64, LinearAlgebra.Cholesky{Float64, Matrix{Float64}}}}:

In any case, you probably want 1:end-1 as otherwise C is regressed on C.

nilshg · November 23, 2022, 11:22pm

Shorter rhs is sum(term.(names(df)[:, 2:end])) (or if you want to make column selection a bit more robust names(df[:, Not(:C)]))

Topic		Replies	Views
Stepwise logistic regress - GLM - non-callable --> callable (Non-call expression encountered) Optimization (Mathematical) regression , glm	7	1265	October 10, 2018
How to get a GLM where formula is programmatically generated New to Julia dataframes , glm	3	1309	September 1, 2020
Iteration over 10 columns to calculate LinearRegression Machine Learning	4	377	March 27, 2021
How to fit a GLM to all (unnamed) features of arbitrary design matrix? Statistics glm	2	1149	February 6, 2019
Using GLM programmatically General Usage question , metaprogramming , glm	8	948	October 8, 2024

Non-call expression encountered

Related topics