I'm also looking for examples of (deceptively) simple programming tasks that genAI doesn't quite get right. Something on the lines of:
Write a function that does <simple task>.
Where in most cases you'd get an output that is either wrong or it kind-of works but not quite.