Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations. (arXiv:2305.14599v2 [cs.CL] UPDATED)
FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models. (arXiv:2305.14481v2 [cs.CL] UPDATED)
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models. (arXiv:2305.14323v3 [cs.CL] UPDATED)
LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages. (arXiv:2305.14263v2 [cs.CL] UPDATED)
Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata. (arXiv:2305.14202v2 [cs.CL] UPDATED)
Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought. (arXiv:2305.13903v2 [cs.CL] UPDATED)
Continually Improving Extractive QA via Human Feedback. (arXiv:2305.12473v2 [cs.CL] UPDATED)
Causal Document-Grounded Dialogue Pre-training. (arXiv:2305.10927v3 [cs.CL] UPDATED)
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models. (arXiv:2305.08322v3 [cs.CL] UPDATED)
RECKONING: Reasoning through Dynamic Knowledge Encoding. (arXiv:2305.06349v3 [cs.CL] UPDATED)
Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation. (arXiv:2305.02820v2 [cs.CL] UPDATED)
Approximating CKY with Transformers. (arXiv:2305.02386v2 [cs.CL] UPDATED)
Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT. (arXiv:2304.01246v2 [cs.CL] UPDATED)
Sentiment Analysis Dataset in Moroccan Dialect: Bridging the Gap Between Arabic and Latin Scripted dialect. (arXiv:2303.15987v2 [cs.CL] UPDATED)
Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding. (arXiv:2303.12513v2 [cs.CV] UPDATED)
Can an Embodied Agent Find Your "Cat-shaped Mug"? LLM-Guided Exploration for Zero-Shot Object Navigation. (arXiv:2303.03480v2 [cs.RO] UPDATED)
xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval. (arXiv:2303.03004v4 [cs.CL] UPDATED)
AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages. (arXiv:2302.08956v5 [cs.CL] UPDATED)
Evaluating Neuron Interpretation Methods of NLP Models. (arXiv:2301.12608v2 [cs.CL] UPDATED)
Dissociating language and thought in large language models. (arXiv:2301.06627v2 [cs.CL] UPDATED)
All recent Computation and Language articles on arXiv.org for the Fediverse
Inspired by https://twitter.com/arxiv_cscl