Why are missing values not ignored by default?

Can anyone point to a publicly available (maybe govt produced) moderate size dataset with missing values and some suggested analyses that could be done with them so we can have a kind of “playground”?

I can probably think of some with some research but maybe someone has a really good example along the lines of a manageable version of @pdeffebach’s comment:

Maybe not 1000+ variables, but something with a few tens of thousands of rows and a hundred ish columns and liberal use of missing?

1 Like