RECALL: A Benchmark for LLMs Robustness against External Counterfactual Knowledge. (arXiv:2311.08147v1 [cs.CL])
Sinkhorn Transformations for Single-Query Postprocessing in Text-Video Retrieval. (arXiv:2311.08143v1 [cs.CL])
Memory-efficient Stochastic methods for Memory-based Transformers. (arXiv:2311.08123v1 [cs.LG])
Insights into Classifying and Mitigating LLMs' Hallucinations. (arXiv:2311.08117v1 [cs.CL])
Improving hateful memes detection via learning hatefulness-aware embedding space through retrieval-guided contrastive learning. (arXiv:2311.08110v1 [cs.CL])
SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training with Adversarial Remarks. (arXiv:2311.08107v1 [cs.CL])
Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models. (arXiv:2311.08106v1 [cs.CL])
DiLoCo: Distributed Low-Communication Training of Language Models. (arXiv:2311.08105v1 [cs.LG])
Exploring Semi-supervised Hierarchical Stacked Encoder for Legal Judgement Prediction. (arXiv:2311.08103v1 [cs.CL])
Empowering Multi-step Reasoning across Languages via Tree-of-Thoughts. (arXiv:2311.08097v1 [cs.CL])
Spot: A Natural Language Interface for Geospatial Searches in OSM. (arXiv:2311.08093v1 [cs.CL])
Align after Pre-train: Improving Multilingual Generative Models with Cross-lingual Alignment. (arXiv:2311.08089v1 [cs.CL])
Data and models for stance and premise detection in COVID-19 tweets: insights from the Social Media Mining for Health (SMM4H) 2022 shared task. (arXiv:2311.08057v1 [cs.CL])
Adversarial Preference Optimization. (arXiv:2311.08045v1 [cs.CL])
Forgetting before Learning: Utilizing Parametric Arithmetic for Knowledge Updating in Large Language Models. (arXiv:2311.08011v1 [cs.CL])
Distantly-Supervised Named Entity Recognition with Uncertainty-aware Teacher Learning and Student-student Collaborative Learning. (arXiv:2311.08010v1 [cs.CL])
TempTabQA: Temporal Question Answering for Semi-Structured Tables. (arXiv:2311.08002v1 [cs.CL])
A Comparative Analysis of the COVID-19 Infodemic in English and Chinese: Insights from Social Media Textual Data. (arXiv:2311.08001v1 [cs.SI])
How Well Do Text Embedding Models Understand Syntax?. (arXiv:2311.07996v1 [cs.CL])
A Survey on Language Models for Code. (arXiv:2311.07989v1 [cs.CL])
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl