Show newer

Is ChatGPT a Good Causal Reasoner? A Comprehensive Evaluation. (arXiv:2305.07375v3 [cs.CL] UPDATED) 

A Framework for Designing Foundation Model based Systems. (arXiv:2305.05352v2 [cs.SE] UPDATED) 

Distilling Script Knowledge from Large Language Models for Constrained Language Planning. (arXiv:2305.05252v3 [cs.CL] UPDATED) 

Augmented Large Language Models with Parametric Knowledge Guiding. (arXiv:2305.04757v2 [cs.CL] UPDATED) 

Unified Model Learning for Various Neural Machine Translation. (arXiv:2305.02777v2 [cs.CL] UPDATED) 

Unlimiformer: Long-Range Transformers with Unlimited Length Input. (arXiv:2305.01625v2 [cs.CL] UPDATED) 

Nondeterministic Stacks in Neural Networks. (arXiv:2304.12955v2 [cs.CL] UPDATED) 

Eyettention: An Attention-based Dual-Sequence Model for Predicting Human Scanpaths during Reading. (arXiv:2304.10784v2 [cs.CL] UPDATED) 

Multi-step Jailbreaking Privacy Attacks on ChatGPT. (arXiv:2304.05197v2 [cs.CL] UPDATED) 

Sociocultural knowledge is needed for selection of shots in hate speech detection tasks. (arXiv:2304.01890v4 [cs.CL] UPDATED) 

Mastering Symbolic Operations: Augmenting Language Models with Compiled Neural Networks. (arXiv:2304.01665v2 [cs.CL] UPDATED) 

The Role of Semantic Parsing in Understanding Procedural Text. (arXiv:2302.06829v2 [cs.CL] UPDATED) 

Unifying Molecular and Textual Representations via Multi-task Language Modelling. (arXiv:2301.12586v2 [cs.LG] UPDATED) 

Case-Based Reasoning with Language Models for Classification of Logical Fallacies. (arXiv:2301.11879v2 [cs.AI] UPDATED) 

Domain-Agnostic Molecular Generation with Self-feedback. (arXiv:2301.11259v3 [cs.LG] UPDATED) 

ClarifyDelphi: Reinforced Clarification Questions with Defeasibility Rewards for Social and Moral Situations. (arXiv:2212.10409v2 [cs.CL] UPDATED) 

Gradient-based Intra-attention Pruning on Pre-trained Language Models. (arXiv:2212.07634v2 [cs.CL] UPDATED) 

From Clozing to Comprehending: Retrofitting Pre-trained Masked Language Model to Pre-trained Machine Reader. (arXiv:2212.04755v2 [cs.CL] UPDATED) 

DC-MBR: Distributional Cooling for Minimum Bayesian Risk Decoding. (arXiv:2212.04205v2 [cs.CL] UPDATED) 

Distilling Reasoning Capabilities into Smaller Language Models. (arXiv:2212.00193v2 [cs.LG] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.