Which GPU should I ask for?

What problems are you facing with using 16 bit computations in Flux? As I understand, everything should be generic wrt the floating point type, but the default is 32 bits.