Another thing that would help all of this is having a more reasoable describe command. The currenty behavior spits out information for each column one at a time, rather than a table that summarizes everything. Having a more readable format would really help this type of debugging.
How do people feel about having describe or return a dataframe or similar tabular object rather than simply dumping the results of describe for each column?