Show newer

On the Exploitability of Reinforcement Learning with Human Feedback for Large Language Models. (arXiv:2311.09641v1 [cs.AI]) 

Evaluating In-Context Learning of Libraries for Code Generation. (arXiv:2311.09635v1 [cs.CL]) 

Online Continual Knowledge Learning for Language Models. (arXiv:2311.09632v1 [cs.CL]) 

From Scroll to Misbelief: Modeling the Unobservable Susceptibility to Misinformation on Social Media. (arXiv:2311.09630v1 [cs.CL]) 

CRISPR: Eliminating Bias Neurons from an Instruction-following Language Model. (arXiv:2311.09627v1 [cs.AI]) 

Take One Step at a Time to Know Incremental Utility of Demonstration: An Analysis on Reranking for Few-Shot In-Context Learning. (arXiv:2311.09619v1 [cs.CL]) 

Simulating Opinion Dynamics with Networks of LLM-based Agents. (arXiv:2311.09618v1 [physics.soc-ph]) 

On Retrieval Augmentation and the Limitations of Language Model Training. (arXiv:2311.09615v1 [cs.CL]) 

Digital Socrates: Evaluating LLMs through explanation critiques. (arXiv:2311.09613v1 [cs.CL]) 

Efficient End-to-End Visual Document Understanding with Rationale Distillation. (arXiv:2311.09612v1 [cs.CV]) 

GistScore: Learning Better Representations for In-Context Example Selection with Gist Bottlenecks. (arXiv:2311.09606v1 [cs.CL]) 

Measuring and Improving Attentiveness to Partial Inputs with Counterfactuals. (arXiv:2311.09605v1 [cs.CL]) 

SCORE: A framework for Self-Contradictory Reasoning Evaluation. (arXiv:2311.09603v1 [cs.CL]) 

Language Models (Mostly) Do Not Consider Emotion Triggers When Predicting Emotion. (arXiv:2311.09602v1 [cs.CL]) 

Multi-Step Dialogue Workflow Action Prediction. (arXiv:2311.09593v1 [cs.CL]) 

LifeTox: Unveiling Implicit Toxicity in Life Advice. (arXiv:2311.09585v1 [cs.CL]) 

Enhancing Medical Text Evaluation with GPT-4. (arXiv:2311.09581v1 [cs.CL]) 

MMOE: Mixture of Multimodal Interaction Experts. (arXiv:2311.09580v1 [cs.CL]) 

Crafting In-context Examples according to LMs' Parametric Knowledge. (arXiv:2311.09579v1 [cs.CL]) 

Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying. (arXiv:2311.09578v1 [cs.CL]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.