PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Preference Alignment https://arxiv.org/abs/2402.08702 #cs.CL #cs.AI #cs.HC #cs.RO
QOTO: Question Others to Teach Ourselves An inclusive, Academic Freedom, instance All cultures welcome. Hate speech and harassment strictly forbidden.