Another thing that would help all of this is having a more reasoable describe
command. The currenty behavior spits out information for each column one at a time, rather than a table that summarizes everything. Having a more readable format would really help this type of debugging.
How do people feel about having describe
or return a dataframe or similar tabular object rather than simply dumping the results of describe
for each column?