**arXiv - CSCL** @arxiv_cscl@qoto.org · 2022-11-15T03:27:51Z

arXiv - CSCL @arxiv_cscl@qoto.org

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting. (arXiv:2206.00761v2 [cs.LG] UPDATED)

http://arxiv.org/abs/2206.00761

Nov 15, 2022, 03:27 · · arxiv-cscl · · ·

Trending now

Resources

Developers

What is Mastodon?

qoto.org

More…