Towards Being Parameter-Efficient: A Stratified Sparsely Activated Transformer with Dynamic Capacity. (arXiv:2305.02176v2 [cs.CL] UPDATED)
GPT-RE: In-context Learning for Relation Extraction using Large Language Models. (arXiv:2305.02105v2 [cs.CL] UPDATED)
Summarizing Multiple Documents with Conversational Structure for Meta-Review Generation. (arXiv:2305.01498v4 [cs.CL] UPDATED)
Prompt as Triggers for Backdoor Attack: Examining the Vulnerability in Language Models. (arXiv:2305.01219v5 [cs.CL] UPDATED)
Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4. (arXiv:2305.00118v2 [cs.CL] UPDATED)
Transformer-Based Language Model Surprisal Predicts Human Reading Times Best with About Two Billion Training Tokens. (arXiv:2304.11389v2 [cs.CL] UPDATED)
Outlier Suppression+: Accurate quantization of large language models by equivalent and optimal shifting and scaling. (arXiv:2304.09145v3 [cs.CL] UPDATED)
Thorny Roses: Investigating the Dual Use Dilemma in Natural Language Processing. (arXiv:2304.08315v2 [cs.CL] UPDATED)
MEGA: Multilingual Evaluation of Generative AI. (arXiv:2303.12528v4 [cs.CL] UPDATED)
Self-supervised Meta-Prompt Learning with Meta-Gradient Regularization for Few-shot Generalization. (arXiv:2303.12314v4 [cs.CL] UPDATED)
Context-faithful Prompting for Large Language Models. (arXiv:2303.11315v2 [cs.CL] UPDATED)
Large Language Model Is Not a Good Few-shot Information Extractor, but a Good Reranker for Hard Samples!. (arXiv:2303.08559v2 [cs.CL] UPDATED)
WiCE: Real-World Entailment for Claims in Wikipedia. (arXiv:2303.01432v2 [cs.CL] UPDATED)
AI Chat Assistants can Improve Conversations about Divisive Topics. (arXiv:2302.07268v5 [cs.HC] UPDATED)
Towards Agile Text Classifiers for Everyone. (arXiv:2302.06541v2 [cs.CL] UPDATED)
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning. (arXiv:2302.04858v2 [cs.CV] UPDATED)
CodeLMSec Benchmark: Systematically Evaluating and Finding Security Vulnerabilities in Black-Box Code Language Models. (arXiv:2302.04012v2 [cs.CR] UPDATED)
Concept Algebra for Score-Based Conditional Models. (arXiv:2302.03693v3 [cs.CL] UPDATED)
Using In-Context Learning to Improve Dialogue Safety. (arXiv:2302.00871v3 [cs.CL] UPDATED)
Knowledge Distillation $\approx$ Label Smoothing: Fact or Fallacy?. (arXiv:2301.12609v3 [cs.LG] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl