Thanks for this package, which looks very useful. I’m interested in exploring possible outliers in a sample of a single random variable, rather than in a regression context. Would this be possible with the package? I have tried setting a formula where the dependent variable is regressed on a constant, and this seems to work in some cases, but not others (see example below). So, is this a reasonable idea for checking for outliers of a single random variable, when using the package? If so, of the methods the package provides, are there recommendations for this usage case? Thanks!
When you set a formula using y ~ 1, a regression model of y = constant + epsilon is estimated and it is still a regression model. I think it is more convenient to use single variable tools in that situation. smr98 is based on a cluster analysis on standardized tuples (yhat, residuals), I think the problem is all of the yhat values are the same in your example (or something else that I can’t cover at first sight). Some of the other algorithms will work in the univariate case but I wouldn’t say this is a recommended method.
Thanks. I realized that I needed to read more about the methods, rather than just use them without checking suitability of a method for this usage case.
After 2 years since the first post, we got much attention to the package. As a re-announcement we are happy to revise our package to v0.8.16, with the latest implementation of the Quantile Regression estimator. Here is the latest list of the algorithms & estimators:
Ordinary Least Squares, Weighted Least Squares, Basic diagnostics