Transformer-Based Language Model Surprisal Predicts Human Reading Times Best with About Two Billion Training Tokens. (arXiv:2304.11389v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2304.11389 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling. (arXiv:2304.09145v3 [cs.CL] UPDATED)

http://arxiv.org/abs/2304.09145 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing. (arXiv:2304.08315v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2304.08315 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

MEGA: Multilingual Evaluation of Generative AI. (arXiv:2303.12528v4 [cs.CL] UPDATED)

http://arxiv.org/abs/2303.12528 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization. (arXiv:2303.12314v4 [cs.CL] UPDATED)

http://arxiv.org/abs/2303.12314 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Context-faithful Prompting for Large Language Models. (arXiv:2303.11315v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2303.11315 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!. (arXiv:2303.08559v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2303.08559 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

WiCE: Real-World Entailment for Claims in Wikipedia. (arXiv:2303.01432v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2303.01432 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

AI Chat Assistants can Improve Conversations about Divisive Topics. (arXiv:2302.07268v5 [cs.HC] UPDATED)

http://arxiv.org/abs/2302.07268 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Towards Agile Text Classifiers for Everyone. (arXiv:2302.06541v2 [cs.CL] UPDATED)

http://arxiv.org/abs/2302.06541 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning. (arXiv:2302.04858v2 [cs.CV] UPDATED)

http://arxiv.org/abs/2302.04858 #arXiv #NLProc

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

**arXiv - CSCL** @arxiv_cscl@qoto.org · Oct 24, 2023, 03:19

Oct 24, 2023, 03:19

arXiv - CSCL @arxiv_cscl@qoto.org

CodeLMSec Benchmark: Systematically Evaluating and Finding Security Vulnerabilities in Black-Box Code Language Models. (arXiv:2302.04012v2 [cs.CR] UPDATED)

http://arxiv.org/abs/2302.04012 #arXiv #NLProc