Thanks, this is all really useful perspective! The analogy to one-hot encoding makes perfect sense.
And yes, I was definitely planning on having a “modular” featurization scheme; I’m certainly keen to explore which atomic features actually matter in the first place as well as play with binning densities and other meta-, hyper-, etc.- parameters