Towards Interpretable and Efficient Automatic Reference-Based Summarization Evaluation. (arXiv:2303.03608v2 [cs.CL] UPDATED)
AmQA: Amharic Question Answering Dataset. (arXiv:2303.03290v2 [cs.CL] UPDATED)
EvoPrompting: Language Models for Code-Level Neural Architecture Search. (arXiv:2302.14838v3 [cs.NE] UPDATED)
Making first order linear logic a generating grammar. (arXiv:2206.08955v5 [cs.CL] UPDATED)
Generative AI for Hate Speech Detection: Evaluation and Findings. (arXiv:2311.09993v1 [cs.CL])
Unambiguity and Fewness for Nonuniform Families of Polynomial-Size Nondeterministic Finite Automata. (arXiv:2311.09979v1 [cs.FL])
Hijacking Large Language Models via Adversarial In-Context Learning. (arXiv:2311.09948v1 [cs.LG])
An Attention-Based Denoising Framework for Personality Detection in Social Media Texts. (arXiv:2311.09945v1 [cs.CY])
Language Generation from Human Brain Activities. (arXiv:2311.09889v1 [cs.CL])
Which Modality should I use -- Text, Motif, or Image? : Understanding Graphs with Large Language Models. (arXiv:2311.09862v1 [cs.CL])
PsyBench: a balanced and in-depth Psychological Chinese Evaluation Benchmark for Foundation Models. (arXiv:2311.09861v1 [cs.CL])
GSAP-NER: A Novel Task, Corpus, and Baseline for Scholarly Entity Extraction Focused on Machine Learning Models and Datasets. (arXiv:2311.09860v1 [cs.CL])
Leveraging LLMs in Scholarly Knowledge Graph Question Answering. (arXiv:2311.09841v1 [cs.CL])
PELMS: Pre-training for Effective Low-Shot Multi-Document Summarization. (arXiv:2311.09836v1 [cs.CL])
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks. (arXiv:2311.09835v1 [cs.CL])
Overview of the HASOC Subtrack at FIRE 2023: Identification of Tokens Contributing to Explicit Hate in English by Span Detection. (arXiv:2311.09834v1 [cs.CL])
X-Mark: Towards Lossless Watermarking Through Lexical Redundancy. (arXiv:2311.09832v1 [cs.CL])
AutoPlanBench: : Automatically generating benchmarks for LLM planners from PDDL. (arXiv:2311.09830v1 [cs.AI])
FollowEval: A Multi-Dimensional Benchmark for Assessing the Instruction-Following Capability of Large Language Models. (arXiv:2311.09829v1 [cs.CL])
AfriMTE and AfriCOMET: Empowering COMET to Embrace Under-resourced African Languages. (arXiv:2311.09828v1 [cs.CL])
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl