Show newer

Show Your Work with Confidence: Confidence Bands for Tuning Curves. (arXiv:2311.09480v1 [cs.CL]) 

ARES: An Automated Evaluation Framework for Retrieval-Augmented Generation Systems. (arXiv:2311.09476v1 [cs.CL]) 

JAB: Joint Adversarial Prompting and Belief Augmentation. (arXiv:2311.09473v1 [cs.AI]) 

Clarify When Necessary: Resolving Ambiguity Through Interaction with LMs. (arXiv:2311.09469v1 [cs.CL]) 

Think While You Write: Hypothesis Verification Promotes Faithful Knowledge-to-Text Generation. (arXiv:2311.09467v1 [cs.CL]) 

Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries. (arXiv:2311.09458v1 [cs.CL]) 

How Trustworthy are Open-Source LLMs? An Assessment under Malicious Demonstrations Shows their Vulnerabilities. (arXiv:2311.09447v1 [cs.CL]) 

Subtle Misogyny Detection and Mitigation: An Expert-Annotated Dataset. (arXiv:2311.09443v1 [cs.CL]) 

Labeled Interactive Topic Models. (arXiv:2311.09438v1 [cs.LG]) 

Backdoor Activation Attack: Attack Large Language Models using Activation Steering for Safety-Alignment. (arXiv:2311.09433v1 [cs.CR]) 

Striped Attention: Faster Ring Attention for Causal Transformers. (arXiv:2311.09431v1 [cs.LG]) 

Beyond Detection: Unveiling Fairness Vulnerabilities in Abusive Language Models. (arXiv:2311.09428v1 [cs.CL]) 

Predicting generalization performance with correctness discriminators. (arXiv:2311.09422v1 [cs.CL]) 

When Large Language Models contradict humans? Large Language Models' Sycophantic Behaviour. (arXiv:2311.09410v1 [cs.CL]) 

Alternatives to the Scaled Dot Product for Attention in the Transformer Neural Network Architecture. (arXiv:2311.09406v1 [cs.LG]) 

To Translate or Not to Translate: A Systematic Investigation of Translation-Based Cross-Lingual Transfer to Low-Resource Languages. (arXiv:2311.09404v1 [cs.CL]) 

LEEETs-Dial: Linguistic Entrainment in End-to-End Task-oriented Dialogue systems. (arXiv:2311.09390v1 [cs.CL]) 

Neural machine translation for automated feedback on children's early-stage writing. (arXiv:2311.09389v1 [cs.CL]) 

Banach-Tarski Embeddings and Transformers. (arXiv:2311.09387v1 [cs.LG]) 

Long-form Question Answering: An Iterative Planning-Retrieval-Generation Approach. (arXiv:2311.09383v1 [cs.CL]) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.