Comparing Machines and Children: Using Developmental Psychology Experiments to Assess the Strengths and Weaknesses of LaMDA Responses. (arXiv:2305.11243v1 [cs.CL])
Comparing Biases and the Impact of Multilingual Training across Multiple Languages. (arXiv:2305.11242v1 [cs.CL])
Recent Trends in Unsupervised Summarization. (arXiv:2305.11231v1 [cs.CL])
LIMA: Less Is More for Alignment. (arXiv:2305.11206v1 [cs.CL])
Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt. (arXiv:2305.11186v1 [cs.CL])
Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents. (arXiv:2305.10383v2 [cs.CL] UPDATED)
FACE: Evaluating Natural Language Generation with Fourier Analysis of Cross-Entropy. (arXiv:2305.10307v2 [cs.CL] UPDATED)
MemoryBank: Enhancing Large Language Models with Long-Term Memory. (arXiv:2305.10250v2 [cs.CL] UPDATED)
EfficientSCI: Densely Connected Network with Space-time Factorization for Large-scale Video Snapshot Compressive Imaging. (arXiv:2305.10006v2 [cs.CV] UPDATED)
"I'm fully who I am": Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation. (arXiv:2305.09941v2 [cs.CL] UPDATED)
Epsilon Sampling Rocks: Investigating Sampling Strategies for Minimum Bayes Risk Decoding for Machine Translation. (arXiv:2305.09860v2 [cs.CL] UPDATED)
DarkBERT: A Language Model for the Dark Side of the Internet. (arXiv:2305.08596v2 [cs.CL] UPDATED)
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist. (arXiv:2305.08566v3 [cs.CL] UPDATED)
A Hierarchical Encoding-Decoding Scheme for Abstractive Multi-document Summarization. (arXiv:2305.08503v3 [cs.CL] UPDATED)
Parameter-Efficient Fine-Tuning with Layer Pruning on Free-Text Sequence-to-Sequence modeling. (arXiv:2305.08285v2 [cs.CL] UPDATED)
Learning to Generalize for Cross-domain QA. (arXiv:2305.08208v2 [cs.CL] UPDATED)
Self-supervised Neural Factor Analysis for Disentangling Utterance-level Speech Representations. (arXiv:2305.08099v2 [cs.SD] UPDATED)
GPT-Sentinel: Distinguishing Human and ChatGPT Generated Content. (arXiv:2305.07969v2 [cs.CL] UPDATED)
Dr. LLaMA: Improving Small Language Models on PubMedQA via Generative Data Augmentation. (arXiv:2305.07804v2 [cs.CL] UPDATED)
Instance Smoothed Contrastive Learning for Unsupervised Sentence Embedding. (arXiv:2305.07424v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl