Single variable least absolute deviation (LAD) linear regression

pauljurczak · March 2, 2019, 9:04am

I’m looking for a simple to use single variable least absolute deviation linear regression library. My problem is modeled by a single variable function with small set (less than 32) of observations. I want to get a linear approximation of it using least absolute deviation.

Paul_Soderlind · March 2, 2019, 9:24am

something like this perhaps?

(T,K) = (size(x,1),size(x,2))

  b_old = zeros(K)
  b     = fill(1e+6,K) .+ prec
  wii   = ones(T)
  while maximum(abs.(b - b_old)) > prec
    b_old = copy(b)
    xw    = x.*wii
    b     = (xw'x)\(xw'y)                     #solve (xw'x)b=xw'y for b
    u     = y - x*b
    wii   = min.(1.0./abs.(u),1/epsu)    #Amemiya p 78, since 1/abs(u) can be infinite
  end

  yhat = x*b                          #fitted values
  u    = y - yhat

  Sxx  = x'x/T                             #Sum{xt)*x(t)'}/T
  h    = 1.06*std(u)/T^0.2                 #kernel density value at u=0
  Kh   = exp.(-0.5*((u/h).^2))/sqrt(2*pi)  #gaussian kernel, with "std" = 1
  f0   = mean(Kh)/h
  Covb = f0^(-2)*inv(Sxx)/(4*T)            #Cov(b)

pauljurczak · March 2, 2019, 9:54am

Thanks, I will give it a try. What are the dependencies?

Paul_Soderlind · March 2, 2019, 10:32am

you need to do using Statistics (for the mean and std calculations). It’s one of the standard libraries, so no installation is required.

Before running, the code you need to define y (vector with dependent variable), x (matrix of regressors), prec (perhaps 1e-5) and epsu (perhaps 1e-6)

pauljurczak · March 2, 2019, 11:13am

What’s matrix of regressors? I’m not familiar with statistical lingo. Let’s say my observations are 1->3, 2->4, 3->7. Do I define:

x = [1, 2, 3]
y = [3, 4, 7]

How do I get predicted values?

Paul_Soderlind · March 2, 2019, 12:07pm

The code wants x to be a matrix, so do x = [1 2 3]'. Then you find the fitted values in yhat

jlperla · March 2, 2019, 3:25pm

It may be overkill here, but is there a succinct way to do it with Flux?

pauljurczak · March 2, 2019, 10:32pm

I have a bit of a problem with the results. For:

x = [1 2 3 4]'
y = [0 1 0 1]'

I’m getting yhat:

4×1 Array{Float64,2}:
 0.24999448434228871
 0.49998896868457743
 0.7499834530268661 
 0.9999779373691549

where sum(abs.(yhat-y)) (LAD) is 1.5000110313154225. But

y0 = [0 1/3 2/3 1]'

provides much better approximation with LAD of 1.3333333333333335

Paul_Soderlind · March 3, 2019, 10:16am

y0 = [0 1/3 2/3 1]’

is what you get when you also include a constant among the regressors. You get this by changing the matrix of regressors to

pauljurczak · March 3, 2019, 11:19am

Thank you. It works now.

Topic		Replies	Views
Efficient way of doing linear regression Performance regression	44	20593	February 7, 2022
Linear Regression step by step for newcomers Machine Learning question	7	4996	November 2, 2022
Best way for linear regression problem on product features General Usage performance , regression , qr	28	298	April 11, 2025
Linear Fit New to Julia curve-fitting	8	4843	August 3, 2022
The simplest linear fit with GLM Tooling glm	13	5284	November 11, 2021

Single variable least absolute deviation (LAD) linear regression

Related topics