@djhsu f must assign contiguous intervals of X to {1,...,K}? Suppose d=1 and we assign X < 10^6 to k=1, X in (k-1)e6 to ke6 to k for k=2,...,10 and X>1e7 to k=11. Then mu0 is approx 0 (though slightly less), all other mu_k should be > (k-1)e6, so the average squared mu_k would be 25e12 which is > log 11. Does that check out? What am I missing. Must be something obvious.
@djhsu oh is the average weighted by interval probability?
@kristinmbranson Yes, the average is weighted by Pr(f(X)=k) rather than 1/K for all k.
@djhsu OK post-shower Kristin buys this, and thinks it would be nice if the log base is e :). What are the implications of this? When should I be arbitrarily quantizing my data? This should hold for any function f: X^d - > [K], correct?
@djhsu I see, so quantizing my Gaussian data, or more generally further quantizing beyond my GMM modes, won't give me much representational power. Interesting! Thanks for sharing!! (Sorry, just found the custom emojis...)
@kristinmbranson I agree, but I forgot the constant :-) Yes, it puts a limit on how well you can quantize "unstructured" data, with any f: not much better than the trivial quantizer (mu=0). Can contrast this to what happens with structured data (e.g., mixture of K Gaussians).