Show newer

Bridging Continuous and Discrete Spaces: Interpretable Sentence Representation Learning via Compositional Operations. (arXiv:2305.14599v2 [cs.CL] UPDATED) 

FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models. (arXiv:2305.14481v2 [cs.CL] UPDATED) 

ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models. (arXiv:2305.14323v3 [cs.CL] UPDATED) 

LIMIT: Language Identification, Misidentification, and Translation using Hierarchical Models in 350+ Languages. (arXiv:2305.14263v2 [cs.CL] UPDATED) 

Fine-tuned LLMs Know More, Hallucinate Less with Few-Shot Sequence-to-Sequence Semantic Parsing over Wikidata. (arXiv:2305.14202v2 [cs.CL] UPDATED) 

Let's Think Frame by Frame with VIP: A Video Infilling and Prediction Dataset for Evaluating Video Chain-of-Thought. (arXiv:2305.13903v2 [cs.CL] UPDATED) 

Continually Improving Extractive QA via Human Feedback. (arXiv:2305.12473v2 [cs.CL] UPDATED) 

Causal Document-Grounded Dialogue Pre-training. (arXiv:2305.10927v3 [cs.CL] UPDATED) 

C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models. (arXiv:2305.08322v3 [cs.CL] UPDATED) 

RECKONING: Reasoning through Dynamic Knowledge Encoding. (arXiv:2305.06349v3 [cs.CL] UPDATED) 

Semantic Space Grounded Weighted Decoding for Multi-Attribute Controllable Dialogue Generation. (arXiv:2305.02820v2 [cs.CL] UPDATED) 

Approximating CKY with Transformers. (arXiv:2305.02386v2 [cs.CL] UPDATED) 

Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT. (arXiv:2304.01246v2 [cs.CL] UPDATED) 

Sentiment Analysis Dataset in Moroccan Dialect: Bridging the Gap Between Arabic and Latin Scripted dialect. (arXiv:2303.15987v2 [cs.CL] UPDATED) 

Is BERT Blind? Exploring the Effect of Vision-and-Language Pretraining on Visual Language Understanding. (arXiv:2303.12513v2 [cs.CV] UPDATED) 

Can an Embodied Agent Find Your "Cat-shaped Mug"? LLM-Guided Exploration for Zero-Shot Object Navigation. (arXiv:2303.03480v2 [cs.RO] UPDATED) 

xCodeEval: A Large Scale Multilingual Multitask Benchmark for Code Understanding, Generation, Translation and Retrieval. (arXiv:2303.03004v4 [cs.CL] UPDATED) 

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages. (arXiv:2302.08956v5 [cs.CL] UPDATED) 

Evaluating Neuron Interpretation Methods of NLP Models. (arXiv:2301.12608v2 [cs.CL] UPDATED) 

Dissociating language and thought in large language models. (arXiv:2301.06627v2 [cs.CL] UPDATED) 

Show older
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.